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Preface 


This  book  originates  from  lecture  notes  for  an  introductory  course  on  stochastic 
calculus  taught  as  part  of  the  master’s  program  in  probability  and  statistics  at 
Universite  Pierre  et  Marie  Curie  and  then  at  Universite  Paris-Sud.  The  aim  of  this 
course  was  to  provide  a  concise  but  rigorous  introduction  to  the  theory  of  stochastic 
calculus  for  continuous  semimartingales,  putting  a  special  emphasis  on  Brownian 
motion.  This  book  is  intended  for  students  who  already  have  a  good  knowledge 
of  advanced  probability  theory,  including  tools  of  measure  theory  and  the  basic 
properties  of  conditional  expectation.  We  also  assume  some  familiarity  with  the 
notion  of  uniform  integrability  (see,  for  instance,  Chapter  VII  in  Grimmett  and 
Stirzaker  [30]).  For  the  reader’s  convenience,  we  record  in  Appendix  A2  those 
results  concerning  discrete  time  martingales  that  we  use  in  our  study  of  continuous 
time  martingales. 

The  first  chapter  is  a  brief  presentation  of  Gaussian  vectors  and  processes.  The 
main  goal  is  to  arrive  at  the  notion  of  a  Gaussian  white  noise,  which  allows  us  to  give 
a  simple  construction  of  Brownian  motion  in  Chap.  2.  In  this  chapter,  we  discuss 
the  basic  properties  of  sample  paths  of  Brownian  motion  and  the  strong  Markov 
property  with  its  classical  application  to  the  reflection  principle.  Chapter  2  also  gives 
us  the  opportunity  to  introduce,  in  the  relatively  simple  setting  of  Brownian  motion, 
the  important  notions  of  filtrations  and  stopping  times,  which  are  studied  in  a  more 
systematic  and  abstract  way  in  Chap.  3.  The  latter  chapter  discusses  continuous  time 
martingales  and  supermartingales  and  investigates  the  regularity  properties  of  their 
sample  paths.  Special  attention  is  given  to  the  optional  stopping  theorem,  which 
in  connection  with  stochastic  calculus  yields  a  powerful  tool  for  lots  of  explicit 
calculations.  Chapter  4  introduces  continuous  semimartingales,  starting  with  a 
detailed  discussion  of  finite  variation  functions  and  processes.  We  then  discuss  local 
martingales,  but  as  in  most  of  the  remaining  part  of  the  course,  we  restrict  our 
attention  to  the  case  of  continuous  sample  paths.  We  provide  a  detailed  proof  of 
the  key  theorem  on  the  existence  of  the  quadratic  variation  of  a  local  martingale. 
Chapter  5  is  at  the  core  of  this  book,  with  the  construction  of  the  stochastic 
integral  with  respect  to  a  continuous  semimartingale,  the  proof  in  this  setting  of  the 
celebrated  Ito  formula,  and  several  important  applications  (Levy’s  characterization 
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theorem  for  Brownian  motion,  the  Dambis-Dubins-Schwarz  representation  of 
a  continuous  martingale  as  a  time-changed  Brownian  motion,  the  Burkholder- 
Davis-Gundy  inequalities,  the  representation  of  Brownian  martingales  as  stochastic 
integrals,  Girsanov’s  theorem  and  the  Cameron-Martin  formula,  etc.).  Chapter  6, 
which  presents  the  fundamental  ideas  of  the  theory  of  Markov  processes  with 
emphasis  on  the  case  of  Feller  semigroups,  may  appear  as  a  digression  to  our  main 
topic.  The  results  of  this  chapter,  however,  play  an  important  role  in  Chap.  7,  where 
we  combine  tools  of  the  theory  of  Markov  processes  with  techniques  of  stochastic 
calculus  to  investigate  connections  of  Brownian  motion  with  partial  differential 
equations,  including  the  probabilistic  solution  of  the  classical  Dirichlet  problem. 
Chapter  7  also  derives  the  conformal  invariance  of  planar  Brownian  motion  and 
applies  this  property  to  the  skew-product  decomposition,  which  in  turn  leads  to 
asymptotic  laws  such  as  the  celebrated  Spitzer  theorem  on  Brownian  windings. 
Stochastic  differential  equations,  which  are  another  very  important  application  of 
stochastic  calculus  and  in  fact  motivated  Ito’s  invention  of  this  theory,  are  studied 
in  detail  in  Chap.  8,  in  the  case  of  Lipschitz  continuous  coefficients.  Here  again 
the  general  theory  developed  in  Chap.  6  is  used  in  our  study  of  the  Markovian 
properties  of  solutions  of  stochastic  differential  equations,  which  play  a  crucial 
role  in  many  applications.  Finally,  Chap.  9  is  devoted  to  local  times  of  continuous 
semimartingales.  The  construction  of  local  times  in  this  setting,  the  study  of  their 
regularity  properties,  and  the  proof  of  the  density  of  occupation  formula  are  very 
convincing  illustrations  of  the  power  of  stochastic  calculus  techniques.  We  conclude 
Chap.  9  with  a  brief  discussion  of  Brownian  local  times,  including  Trotter’s  theorem 
and  the  famous  Levy  theorem  identifying  the  law  of  the  local  time  process  at  level  0. 

A  number  of  exercises  are  listed  at  the  end  of  every  chapter,  and  we  strongly 
advise  the  reader  to  try  them.  These  exercises  are  especially  numerous  at  the  end 
of  Chap.  5,  because  stochastic  calculus  is  primarily  a  technique,  which  can  only 
be  mastered  by  treating  a  sufficient  number  of  explicit  calculations.  Most  of  the 
exercises  come  from  exam  problems  for  courses  taught  at  Universite  Pierre  et  Marie 
Curie  and  at  Universite  Paris-Sud  or  from  exercise  sessions  accompanying  these 
courses. 

Although  we  say  almost  nothing  about  applications  of  stochastic  calculus  in 
other  fields,  such  as  mathematical  finance,  we  hope  that  this  book  will  provide  a 
strong  theoretical  background  to  the  reader  interested  in  such  applications.  While 
presenting  all  tools  of  stochastic  calculus  in  the  general  setting  of  continuous 
semimartingales,  together  with  some  of  the  most  important  developments  of  the 
theory,  we  have  tried  to  keep  this  text  to  a  reasonable  size,  without  making  any 
concession  to  mathematical  rigor.  The  reader  who  wishes  to  go  further  in  the  theory 
and  applications  of  stochastic  calculus  may  consult  the  classical  books  of  Karatzas 
and  Shreve  [49],  Revuz  and  Yor  [70],  or  Rogers  and  Williams  [72].  For  a  historical 
perspective  on  the  development  of  the  theory,  we  recommend  Ito’s  original  papers 
[41]  and  McKean’s  book  [57],  which  greatly  helped  to  popularize  Ito’s  work.  More 
information  about  stochastic  differential  equations  can  be  found  in  the  books  by 
Stroock  and  Varadhan  [77],  Ikeda  and  Watanabe  [35],  and  0ksendal  [66].  Stochastic 
calculus  for  semimartingales  with  jumps,  which  we  do  not  present  in  this  book,  is 
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treated  in  Jacod  and  Shiryaev  [44]  or  Protter  [63]  and  in  the  classical  treatise  of 
Dellacherie  and  Meyer  [13,  14].  Many  other  references  for  further  reading  appear  in 
the  notes  and  comments  at  the  end  of  every  chapter. 

I  wish  to  thank  all  those  who  attended  my  stochastic  calculus  lectures  in  the  last 
20  years  and  who  contributed  to  this  book  through  their  questions  and  comments. 
I  am  especially  indebted  to  Marc  Yor,  who  left  us  too  soon.  Marc  taught  me  most 
of  what  I  know  about  stochastic  calculus,  and  his  numerous  remarks  helped  me  to 
improve  this  text. 

Orsay,  France  Jean-Frangois  Le  Gall 

January  2016 
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Chapter  1 

Gaussian  Variables  and  Gaussian  Processes 


Gaussian  random  processes  play  an  important  role  both  in  theoretical  probability 
and  in  various  applied  models.  We  start  by  recalling  basic  facts  about  Gaussian  ran¬ 
dom  variables  and  Gaussian  vectors.  We  then  discuss  Gaussian  spaces  and  Gaussian 
processes,  and  we  establish  the  fundamental  properties  concerning  independence 
and  conditioning  in  the  Gaussian  setting.  We  finally  introduce  the  notion  of  a 
Gaussian  white  noise,  which  will  be  used  to  give  a  simple  construction  of  Brownian 
motion  in  the  next  chapter. 


1.1  Gaussian  Random  Variables 

Throughout  this  chapter,  we  deal  with  random  variables  defined  on  a  probability 
space  (X2,  P).  For  some  of  the  existence  statements  that  follow,  this  probability 

space  should  be  chosen  in  an  appropriate  way.  For  every  real  p  >  1,  LP  (£2,  PP ,  P), 
or  simply  LP  if  there  is  no  ambiguity,  denotes  the  space  of  all  real  random  variables 
X  such  that  \X\P  is  integrable,  with  the  usual  convention  that  two  random  variables 
that  are  a.s.  equal  are  identified.  The  space  LP  is  equipped  with  the  usual  norm. 

A  real  random  variable  X  is  said  to  be  a  standard  Gaussian  (or  normal)  variable 
if  its  law  has  density 


1  v2 

Px(x)  =  — =exp(-— ) 
V27T  2 


with  respect  to  Lebesgue  measure  on  R.  The  complex  Laplace  transform  of  X  is 
then  given  by 


E[ezX]  =  C/2,  Vz  €  C. 
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1  Gaussian  Variables  and  Gaussian  Processes 


To  get  this  formula  (and  also  to  verify  that  the  complex  Laplace  transform  is  well 
defined),  consider  first  the  case  when  z  —  A  G  R: 

E[txx]  —  _  f  qXx  q~x^2  dx  —  e2^2  _  f  Q-^-^y/2  dx  =  eA^2. 

\/2tc  Jr  \J2tc  Jr 

This  calculation  ensures  that  E[ezX]  is  well-defined  for  every  z  g  C,  and  defines  a 
holomorphic  function  on  C.  By  analytic  continuation,  the  identity  E[tzX]  —  ez_//2, 
which  is  true  for  every  z  G  R,  must  also  be  true  for  every  z  G  C. 

By  taking  z  =  i£,  £  G  R,  we  get  the  characteristic  function  of  X : 

^[e1^]  =  e“^2/2. 


From  the  expansion 

(\  ty 

E[RX]  =  1  +  i  $E[X\  +  •  •  •  +  Elj-E[Xn]  +  0(  |?r+1), 

n\ 

as  £  ->  0  (this  expansion  holds  for  every  n  >  1  when  X  belongs  to  all  spaces  Lp , 
1  <  p  <  oo,  which  is  the  case  here),  we  get 

£[X]  =  0,  E[X2]  =  1 

and  more  generally,  for  every  integer  n  >  0, 

E[X2n]  =  f .  £’[X2',+1]  =  0. 

2"«! 

If  a  >  0  and  m  G  R,  we  say  that  a  real  random  variable  7  is  Gaussian  with 
jV(m,  a2) -distribution  if  Y  satisfies  any  of  the  three  equivalent  properties: 

(i)  Y  —  aX  +  m,  where  X  is  a  standard  Gaussian  variable  (i.e.  X  follows  the 
yY(0, 1) -distribution); 

(ii)  the  law  of  Y  has  density 


Priy)  = 


1  ( y-m )2 

— 7==  exP - 

o\/2tc  2  a- 


(iii)  the  characteristic  function  of  7  is 


£[ei?y]  =  exp(im£ 


We  have  then 


E[Y]  —  m,  var(7)  =  cr2. 
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By  extension,  we  say  that  Y  is  Gaussian  with  JT(m,  0) -distribution  if  Y  —  m  a.s. 
(property  (iii)  still  holds  in  that  case). 

Sums  of  independent  Gaussian  variables  Suppose  that  Y  follows  the  XT{m,  (re¬ 
distribution,  Y'  follows  the  jE(m! ,  a'2) -distribution,  and  Y  and  Y'  are  independent. 
Then  Y  +  Y'  follows  the  XY(m  +  m! ,  a2  +  a/2)-distribution.  This  is  an  immediate 
consequence  of  (iii). 

Proposition  1.1  Let  {Xn)n>\  be  a  sequence  of  real  random  variables  such  that,  for 
every  n  >  1,  Xn  follows  the  jY(mn,  a2) -distribution.  Suppose  that  Xn  converges  in 
L 2  to  X.  Then: 

(i)  The  random  variable  X  follows  the  XY(m,  a2)- distribution,  where  m  —  lim mn 
and  g  —  lim  an. 

(ii)  The  convergence  also  holds  in  all  LP  spaces,  1  <  p  <  oo. 

Remark  The  assumption  that  Xn  converges  in  L2  to  X  can  be  weakened  to 
convergence  in  probability  (and  in  fact  the  convergence  in  distribution  of  the 
sequence  {Xn)n>\  suffices  to  get  part  (i)).  We  leave  this  as  an  exercise  for  the  reader. 

Proof 

(i)  The  convergence  in  L 2  implies  that  mn  =  E[Xn\  converges  to  E[X]  and  a2  — 
var(Xw)  converges  to  var(X)  as  n  ->  oo.  Then,  setting  m  —  E[X\  and  a2  = 
var(X),  we  have  for  every  £  e  R, 

2  2 

E[el^x]  =  lim  E[el^Xn]  =  lim  exp(i - -  £2)  =  exp(im£ - £2), 

n — >oo  n—>oo  2  2 

showing  that  X  follows  the  a2) -distribution. 

(ii)  Since  Xn  has  the  same  distribution  as  anN  +  mn,  where  is  a  standard  Gaussian 
variable,  and  since  the  sequences  ( mn )  and  ( an )  are  bounded,  we  immediately 
see  that 

supE[\Xn\q]  <  oo,  Xq  >  1. 

n 

It  follows  that 

supE[|Xn  —  X\q]  <  oo,  Wq  >  1. 

n 

Let  p  >  1.  The  sequence  Yn  —  \Xn  —  X\p  converges  in  probability  to  0  and 
is  uniformly  integrable  because  it  is  bounded  in  L 2  (by  the  preceding  bound 
with  q  —  2 p).  It  follows  that  this  sequence  converges  to  0  in  L1,  which  was  the 
desired  result. 

□ 
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1  Gaussian  Variables  and  Gaussian  Processes 


1.2  Gaussian  Vectors 

Let  E  be  a  d-dimensional  Euclidean  space  ( E  is  isomorphic  to  Wl  and  we  may  take 
E  —  M.d,  with  the  usual  inner  product,  but  it  will  be  more  convenient  to  work  with 
an  abstract  space).  We  write  (w,  v)  for  the  inner  product  in  E.  A  random  variable  A 
with  values  in  E  is  called  a  Gaussian  vector  if,  for  every  u  e  E,  (i u,X )  is  a  (real) 
Gaussian  variable.  (For  instance,  if  E  —  Mrf,  and  if  X\ , . . . ,  A^  are  are  independent 
Gaussian  variables,  the  property  of  sums  of  independent  Gaussian  variables  shows 
that  the  random  vector  X  —  (X\ , . . . ,  Xj)  is  a  Gaussian  vector.) 

Let  A  be  a  Gaussian  vector  with  values  in  E.  Then  there  exist  mx  £  E  and  a 
nonnegative  quadratic  form  qx  on  E  such  that,  for  every  u  e  E, 

E[(u,X )]  =  (w,  mx), 
var((w,X))  =  qx(u). 

Indeed,  let  (e\, ... ,  ef)  be  an  orthonormal  basis  on  E ,  and  write  X  —  Y^i=\  Xj ej 
in  this  basis.  Notice  that  the  random  variables  Xj  =  [ej,X)  are  Gaussian.  It  is  then 

immediate  that  the  preceding  formulas  hold  with  mx  —  J2j=  i  E[Xj\  ej  (=')  E[X], 
and,  if  u  —  Y^j=  i  ujej* 


qx(u)  = 


d 

^  ^  UjUk  c o\(Xj,  Xfc) . 

j>k= 1 


Since  ( u ,  A)  follows  the  XE((u,  mx),  qx(u)) -distribution,  we  get  the  characteristic 
function  of  the  random  vector  A, 


£[exp(i(w,A))] 


=  exp(i  (w, 


1 


(1.1) 


Proposition  1.2  Under  the  preceding  assumptions,  the  random  variables 
Ai , . . . ,  A d  are  independent  if  and  only  if  the  covariance  matrix  (cov(Aj,  Xk))\<jx<d 
is  diagonal  or  equivalently  if  and  only  if  qx  is  of  diagonal  form  in  the  basis 
(ei, . . . ,  ef). 

Proof  If  the  random  variables  Ai , . . . ,  A d  are  independent,  the  covariance  matrix 
(cov(Xj,Xk))j,k=i,...d  is  diagonal.  Conversely,  if  this  matrix  is  diagonal,  we  have  for 

every  u  =  J2j=i  ujej  e  E, 


d 

qx(u)  =  22  h  uj  ’ 
7=1 
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where  A j  =  var (Xj).  Consequently,  using  (1.1), 

d  j  d 

=  n  ^p(^jE[Xj]  -  - Xjitf )  =  ]~~[£[exp(i/y;X;)], 

7=1  7=1  7=1 

which  implies  that  X\ , . . . ,  Xj  are  independent.  □ 

With  the  quadratic  form  qx ,  we  associate  the  unique  symmetric  endomorphism 
yx  of  E  such  that 


exp 


(i  E  w 


<?x(w)  =  (m,  Yx(u)) 

(the  matrix  of  yx  in  the  basis  (e\ ,...,ef)  is  (cov(X^,  but  of  course 

the  definition  of  yx  does  not  depend  on  the  choice  of  a  basis).  Note  that  yx  is 
nonnegative  in  the  sense  that  its  eigenvalues  are  all  nonnegative. 

From  now  on,  to  simplify  the  statements,  we  restrict  our  attention  to  centered 
Gaussian  vectors,  i.e.  such  that  mx  —  0,  but  the  following  results  are  easily  adapted 
to  the  non-centered  case. 

Theorem  1.3 

(i)  Let  y  be  a  nonnegative  symmetric  endomorphism  of  E.  Then  there  exists  a 
Gaussian  vector  X  such  that  yx  —  y. 

(ii)  Let  X  be  a  centered  Gaussian  vector.  Let  (s  \, ...  ,£f)  be  a  basis  of  E  in  which 
yx  is  diagonal,  yx£j  =  A j£jfor  every  1  <j<d,  where 

Ai  >  A2  >  •  •  •  >  Ar  >  0  =  Ar+i  =  •  •  •  =  A d 

so  that  r  is  the  rank  of  yx .  Then, 


r 

*  =  E  YJeJ> 

7=1 

where  Yj,  1  <  j  <  r,  are  independent  ( centered )  Gaussian  variables  and 
the  variance  of  Yj  is  Xj.  Consequently,  if  Px  denotes  the  distribution  of  X, 
the  topological  support  of  Px  is  the  vector  space  spanned  by  £\,...,sr. 
Furthermore,  Px  is  absolutely  continuous  with  respect  to  Lebesgue  measure 
on  E  if  and  only  if  r  —  d,  and  in  that  case  the  density  ofX  is 

wW=(2^/VdelfteX 
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Proof 

(i)  Let  (ei, . . . ,  sd)  be  an  orthonormal  basis  of  E  in  which  y  is  diagonal,  y{ej)  — 
A jSj  for  1  <  j  <  d,  and  let  Y\, ...  ,Yd  be  independent  centered  Gaussian 
variables  with  var (Yj)  =  A;-,  1  <j<d.  We  set 

d 

*  =  E  YJeJ- 

j=  1 


Then,  if  u  —  J2j=i  uj£j > 


d  ^  d 

qx(u)  =  E  uJYi)  =  E  XJUJ  =  (M’  r(“)> 

7=1  7=1 


(ii)  Let  F] , . . . ,  L/  be  the  coordinates  of  X  in  the  basis  (ei , ,£d).  Then  the 
matrix  of  yx  in  this  basis  is  the  covariance  matrix  of  Y\, . . . ,  Yd.  The  latter 
covariance  matrix  is  diagonal  and,  by  Proposition  1.2,  the  variables  Y\ , . . . ,  Yd 
are  independent.  Furthermore,  for  j  E  {r  +  1 , . . . ,  d},  we  have  E[Yj]  —  0  hence 
Yj  =  0  a.s. 

Then,  since  X  —  J2j=  i  Yj£j  a-s-’  ^  clear  that  suppPx  is  contained  in  the 
subspace  spanned  by  £\ , . . . ,  sr.  Conversely,  if  O  is  a  rectangle  of  the  form 


O  —  {u  —  djSj  :  cij  <  oij  <  bj ,  VI  <j<  r}, 
7=1 


we  have  P[X  e  O]  —  YYj=i  P[aj  K  Yj  <  bj]  >  0.  This  is  enough  to  get  that  supp  Px 
is  the  subspace  spanned  by  £\ , . . . ,  sr. 

If  r  <  d,  since  the  vector  space  spanned  by  £\ , . . . ,  sr  has  zero  Lebesgue  measure, 
the  distribution  of  X  is  singular  with  respect  to  Lebesgue  measure  on  E.  Suppose 
that  r  —  d,  and  write  Y  for  the  random  vector  in  W1  defined  by  Y  =  (Fi , . . . ,  Yd). 
Note  that  the  bijection  <p(y\,...  ,yd)  =  J2yj£j  maPs  F  to  X.  Then,  writing  y  — 
(yi,...,yd),  we  have 

E[g(X)]  =  E[g(cp(Y))] 


dyi . . .  d yd 
's/ A i  . . .  Xd 


1 

( 2jr)d /2  Vhet  yx 

1 

( 2jt)d /2  Vdet  yx 


dy\  ...  d yd 
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since  Lebesgue  measure  on  E  is  by  definition  the  image  of  Lebesgue  measure  on  W1 
under  (p  (or  under  any  other  vector  isometry  from  W1  onto  E).  In  the  second  equality, 
we  used  the  fact  that  Y\ , . . . ,  Yd  are  independent  Gaussian  variables,  and  in  the  third 
equality  we  observed  that 


d  d 

(<p(y),Yxl(<p(y)))  = 

7=1  7=1 


□ 


1.3  Gaussian  Processes  and  Gaussian  Spaces 

From  now  on  until  the  end  of  this  chapter,  we  consider  only  centered  Gaussian 
variables,  and  we  frequently  omit  the  word  “centered”. 

Definition  1.4  A  (centered)  Gaussian  space  is  a  closed  linear  subspace  of 
l2(,g  ,  SP ,  P )  which  contains  only  centered  Gaussian  variables. 

For  instance,  if  X  =  (Xi , . . . ,  Xf)  is  a  centered  Gaussian  vector  in  W /,  the  vector 
space  spanned  by  {X\ , . . . ,  Xd)  is  a  Gaussian  space. 

Definition  1.5  Let  (E,  S)  be  a  measurable  space,  and  let  T  be  an  arbitrary  index 
set.  A  random  process  (indexed  by  T)  with  values  in  E  is  a  collection  {Xt)teT  of 
random  variables  with  values  in  E.  If  the  measurable  space  ( E ,  S)  is  not  specified, 
we  will  implicitly  assume  that  E  —  R  and  S  —  ^(M)  is  the  Borel  cr-field  on  R. 

Here  and  throughout  this  book,  we  use  the  notation  SS(F)  for  the  Borel  a -field 
on  a  topological  space  F.  Most  of  the  time,  the  index  set  T  will  be  M+  or  another 
interval  of  the  real  line. 

Definition  1.6  A  (real- valued)  random  process  (Xt)teT  is  called  a  (centered)  Gaus¬ 
sian  process  if  any  finite  linear  combination  of  the  variables  Xt,  t  e  T  is  centered 
Gaussian. 

Proposition  1.7  If  ( Xt)teT  is  a  Gaussian  process,  the  closed  linear  sub  space  of 
E 2  spanned  by  the  variables  Xt ,  t  e  T,  is  a  Gaussian  space,  which  is  called  the 
Gaussian  space  generated  by  the  process  X. 

Proof  It  suffices  to  observe  that  an  L2-limit  of  centered  Gaussian  variables  is  still 
centered  Gaussian,  by  Proposition  1.1.  □ 

We  now  turn  to  independence  properties  in  a  Gaussian  space.  We  need  the 
following  definition. 

Definition  1.8  Let  H  be  a  collection  of  random  variables  defined  on  (X2,  &  ,P). 
The  cr-field  generated  by  H ,  denoted  by  cr(//),  is  the  smallest  cr-field  on  £2  such  that 
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all  variables  £  G  H  are  measurable  for  this  a-field.  If  ^  is  a  collection  of  subsets  of 
E2,  we  also  write  a(^)  for  the  smallest  a-field  on  Q  that  contains  all  elements  of 

hg. 


The  next  theorem  shows  that,  in  some  sense,  independence  is  equivalent  to 
orthogonality  in  a  Gaussian  space.  This  is  a  very  particular  property  of  the  Gaussian 
distribution. 

Theorem  1.9  Let  H  be  a  centered  Gaussian  space  and  let  (Z//)/g/  be  a  collection  of 
linear  subspaces  of  H.  Then  the  subspaces  Hi,  i  G  I,  are  (pairwise)  orthogonal  in 
L 2  if  and  only  the  a -fields  a  (Hi),  i  G  I,  are  independent. 

Remark  It  is  crucial  that  the  vector  spaces  Ht  are  subspaces  of  a  common  Gaussian 
space  H.  Consider  for  instance  a  random  variable  X  distributed  according  to 
yK(0, 1)  and  another  random  variable  s  independent  of  X  and  such  that  P[s  —  1]  = 
P[s  —  —1]  =  1/2.  Then  X\  —  X  and  X2  —  sX  are  both  distributed  according 
to  yK(0, 1).  Moreover,  E[X  1X2]  —  £[£]£[X2]  =  0.  Nonetheless  X\  and  X2  are 
obviously  not  independent  (because  |Xi  |  =  IX2I).  In  this  example,  (Xi,2G)  is  not  a 
Gaussian  vector  in  M2  despite  the  fact  that  both  coordinates  are  Gaussian  variables. 

Proof  Suppose  that  the  a-fields  a  (Hi)  are  independent.  Then,  if  i  ^  j,  if  X  g  Ht 
and  Y  g  Hj, 


E[XY]  =  E[X]E[Y]  =  0, 

so  that  the  linear  spaces  Ht  are  pairwise  orthogonal. 

Conversely,  suppose  that  the  linear  spaces  Ht  are  pairwise  orthogonal.  From  the 
definition  of  the  independence  of  an  infinite  collection  of  a-fields,  it  is  enough  to 
prove  that,  if  i\, . . . ,  ip  el  are  distinct,  the  a-fields  o(Hi{), . . . ,  a  (Hi)  are  indepen¬ 
dent.  To  this  end,  it  is  enough  to  verify  that,  if  , ... ,  e  Hix,  ...  ,£/,...,  g 
Hip  are  fixed,  the  vectors  (£{,...,  ),...,  (%l{, ... ,  ££  )  are  independent  (indeed, 

for  every  j  e  {1, . . .  ,p},  the  events  of  the  form  {f\  e  A\, . . . ,  g  An .}  give  a  class 
stable  under  finite  intersections  that  generates  the  a-field  a  (Hi),  and  the  desired 
result  follows  by  a  standard  monotone  class  argument,  see  Appendix  Al).  However, 
for  every  j  G  {1, ...  ,p}  we  can  find  an  orthonormal  basis  (r)[, ... ,  r]Jmj)  of  the  linear 

subspace  of  L 2  spanned  by  {£{,...,  £{  }.  The  covariance  matrix  of  the  vector 

(hi ,  •  •  •  ,  hm\  ’  hi  ’  •  •  •  ’  hm2  ’  ’  ’  ’  ’  hi  ’  •  •  •  ’  hmp ) 

is  then  the  identity  matrix  (for  i  j,  E[r)\r]l\  —  0  because  Hi  and  Hj  are  orthogonal). 
Moreover,  this  vector  is  Gaussian  because  its  components  belong  to  H.  By  Proposi¬ 
tion  1.2,  the  components  of  the  latter  vector  are  independent  random  variables.  This 
implies  in  turn  that  the  vectors  (rj\, ... ,  rjlm ),...,  (rj[, ... ,  r)^p)  are  independent. 

Equivalently,  the  vectors  (£|, . . . ,  ),...,  (£{\  ... ,  )  are  independent,  which  was 

the  desired  result.  □ 
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As  an  application  of  the  previous  theorem,  we  now  discuss  conditional  expecta¬ 
tions  in  the  Gaussian  framework.  Again,  the  fact  that  these  conditional  expectations 
can  be  computed  as  orthogonal  projections  (as  shown  in  the  next  corollary)  is  very 
particular  to  the  Gaussian  setting. 

Corollary  1.10  Let  H  be  a  ( centered )  Gaussian  space  and  let  K  be  a  closed  linear 
subspace  ofH.  Let  px  denote  the  orthogonal  projection  onto  K  in  the  Hilbert  space 
L2,  and  let  X  e  H. 

(i)  We  have 


E[X  |  g(K)]  =  pK(X). 


(ii)  Let  g2  —  E[(X  —  pk(X))2].  Then,  for  every  Borel  subset  A  o/R,  the  random 
variable  P[X  e  A  |  cr  (A)]  is  given  by 

P[XeA  |  o(K)\(a))  =  Q(co,A ), 

where  Q(oo,  •)  denotes  the  JP (px(X)(co),  o2) -distribution: 


Q(co,A)  =  — l—  [  d y  exp  (  - 
ay  2tt  Ja  v 


{y  -  pK{X){co)f 
2  o2 


) 


(and  by  convention  Q(co,A)  =  1a(pk(X))  ifo  —  0). 


Remarks 

(a)  Part  (ii)  of  the  statement  can  be  interpreted  by  saying  that  the  conditional 
distribution  of  X  knowing  o(K)  is  XP(pk{X),  g2). 

(b)  For  a  general  random  variable  A  in  L2,  one  has 

E[X  |  a ( K) ]  =  pL 2 (q  q- (K)  p)  (A) . 

Assertion  (i)  shows  that,  in  our  Gaussian  framework,  this  orthogonal  projection 
coincides  with  the  orthogonal  projection  onto  the  space  A,  which  is  “much 
smaller”  than  L2  (L2 ,  g  (K) ,  P) . 

(c)  Assertion  (i)  also  gives  the  principle  of  linear  regression.  For  instance,  if 
(Ai,  A2,  A3)  is  a  (centered)  Gaussian  vector  in  M3,  the  best  approximation  in 
L2  of  A3  as  a  (not  necessarily  linear)  function  of  X\  and  A2  can  be  written 
A1A1  +  A2A2  where  A 1  and  A2  are  computed  by  saying  that  A3  —  (A1A1  +  A2A2) 
is  orthogonal  to  the  vector  space  spanned  by  X\  and  A2. 
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Proof 

(i)  Let  Y  —  X  —  Pk(X).  Then  Y  is  orthogonal  to  K  and,  by  Theorem  1.9,  Y  is 
independent  of  o(K).  Then, 

E[X  |  o(K)\  =  E[pK(X)  \  a(K)}  +  E[Y  |  a(/t)]  =  pK(X)  +  E[Y ]  =  pK(X). 

(ii)  For  every  nonnegative  measurable  function/  on  R+, 

E[f(X)  |  rr(K)]  =  E[f(pK(X )  +  Y)  \  a(K)]  =  J  PY(dy)f(pK(X)  +  y), 

where  PY  is  the  law  of  T,  which  is  zE(0,  a2)  since  Y  is  centered  Gaussian  with 
variance  a2.  In  the  second  equality,  we  also  use  the  following  general  fact:  if 
Z  is  a  ^-measurable  random  variable  and  if  Y  is  independent  of  then,  for 
every  nonnegative  measurable  function  g ,  E[g(Y,Z )  |  Z?]  —  f  g(y,Z)  PY(dy). 
Property  (ii)  immediately  follows. 

□ 

Let  (Xt)teT  be  a  (centered)  Gaussian  process.  The  covariance  function  of  X  is 
the  function  r  :  T  x  T  — >  R  defined  by  r(s,t )  =  cov(X5,X?)  =  E[XsXt\.  This 
function  characterizes  the  collection  of  finite- dimensional  marginal  distributions  of 
the  process  X ,  that  is,  the  collection  consisting  for  every  choice  of  the  distinct  indices 
t\, . . . ,  tp  in  T  of  the  law  of  the  vector  ( Xtl , ,Xtp).  Indeed  this  vector  is  a  centered 
Gaussian  vector  in  W  with  covariance  matrix  (L(^,  (,•)) i<i, ./'</? • 

Remark  One  can  define  in  an  obvious  way  the  notion  of  a  non-centered  Gaussian 
process.  The  collection  of  finite-dimensional  marginal  distributions  is  then  charac¬ 
terized  by  the  covariance  function  and  the  mean  function  1 1->  m(t)  =  E[Xt\. 

Given  a  function  r  on  T  x  T,  one  may  ask  whether  there  exists  a  Gaussian 
process  X  whose  r  is  the  covariance  function.  The  function  r  must  be  symmetric 
( r(s,t )  =  r(t,s))  and  of  positive  type  in  the  following  sense:  if  c  is  a  real  function 
on  T  with  finite  support,  then 


c(s)c(t)  r(s,  t)  >  0. 

TxT 

Indeed,  if  r  is  the  covariance  function  of  the  process  X ,  we  have  immediately 

c(s)c(t)  r(s,  t)  =  var^  >  0. 

TxT  T 

Note  that  when  T  is  finite,  the  problem  of  the  existence  of  X  is  solved  under  the 
preceding  assumptions  on  r  by  Theorem  1.3. 

The  next  theorem  solves  the  existence  problem  in  the  general  case.  This  theorem 
is  a  direct  consequence  of  the  Kolmogorov  extension  theorem,  which  in  the 
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particular  case  T  —  M+  is  stated  as  Theorem  6.3  in  Chap.  6  below  (see  e.g. 
Neveu  [64,  Chapter  III],  or  Kallenberg  [47,  Chapter  VI]  for  the  general  case).  We 
omit  the  proof  as  this  result  will  not  be  used  in  the  sequel. 

Theorem  1.11  Let  T  be  a  symmetric  function  of  positive  type  on  T  x  T.  There 
exists,  on  an  appropriate  probability  space  (X2,  P),  a  centered  Gaussian  process 

whose  covariance  function  is  T. 

Example  Consider  the  case  T  —  R  and  let  g  be  a  finite  measure  on  R,  which  is 
also  symmetric  (i.e.  /x(— A)  =  /z(A)).  Then  set,  for  every  s,  t  e  R, 


ei£  (t-s) 


It  is  easy  to  verify  that  T  has  the  required  properties.  In  particular,  if  c  is  a  real 
function  on  R  with  finite  support, 


E  c(s)c(r)  r(s,  t) 

MxR 


/z(d£)  >  0. 


The  process  T  enjoys  the  additional  property  that  T(s,  t)  only  depends  on  t  —  s.  It 
immediately  follows  that  any  (centered)  Gaussian  process  (Xt)teR  with  covariance 
function  T  is  stationary  (in  a  strong  sense),  meaning  that 


(d) 


~\~t ?  ’  ’  ’  ’  Xtn+t)  =  (Xtl,Xt2,. 


for  any  choice  of  t\, ...  ,tn,t  e  R.  Conversely,  any  stationary  Gaussian  process 
X  indexed  by  R  has  a  covariance  of  the  preceding  type  (this  is  Bodmer’ s  theorem, 
which  we  will  not  use  in  this  book),  and  the  measure  p  is  called  the  spectral  measure 
of  X. 


1.4  Gaussian  White  Noise 

Definition  1.12  Let  (  /:  .  S’)  be  a  measurable  space,  and  let  ji  be  a  a-finite  measure 
on  ( E ,  S).  A  Gaussian  white  noise  with  intensity  ji  is  an  isometry  G  from 
L2(E,  (o  ,  p)  into  a  (centered)  Gaussian  space. 

Hence,  iff  e  L2(E,  (o,  /z),  G(f)  is  centered  Gaussian  with  variance 
E[G{f)2]  =  \\G(f)\\2LWp)  =  \\f\\2LHM  =  f  f2  An. 
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Iff,  g  £  L2(E,  S,  /x),  the  covariance  of  G(/)  and  G(g)  is 

E[G(f)G(g)\  =  </,  g)L2(M  =  jfg  d/i. 

In  particular,  iff  =  1a  with  /x(A)  <  oo,  G(1a)  is  c/L(0, /x (A)) -distributed.  To 
simplify  notation,  we  will  write  G(A)  —  G(1a). 

Let  A\, ...  ,An  £  S  be  disjoint  and  such  that  /x(A;)  <  oo  for  every  j.  Then  the 
vector 


(G(A0,...,G(An)) 

is  a  Gaussian  vector  in  W1  and  its  covariance  matrix  is  diagonal  since,  if  i  ^  j , 

£[G(A,-)G(A,-)]  =  (1A,  1Aj)l2(M  =  0. 

From  Proposition  1.2,  we  get  that  the  variables  G(Ai), . . . ,  G(An )  are  independent. 

Suppose  that  A  £  ^  is  such  that  /x(A)  <  oo  and  that  A  is  the  disjoint  union  of 
a  countable  collection  A i,  A2, ...  of  measurable  subsets  of  E.  Then,  1a  —  1a7- 

where  the  series  converges  in  L2(£,  /x),  and  by  the  isometry  property  this  implies 

that 


00 

G(A)  =  J2  G(Aj) 

7=1 

where  the  series  converges  in  L2(^2,^,P)  (since  the  random  variables  G(Aj) 
are  independent,  an  easy  application  of  the  convergence  theorem  for  discrete 
martingales  also  shows  that  the  series  converges  a.s.). 

Properties  of  the  mapping  A  1-^  G(A)  are  therefore  very  similar  to  those  of  a 
measure  depending  on  the  parameter  co  £  £2.  However,  one  can  show  that,  if  co  is 
fixed,  the  mapping  A  1-^  G(A)(<x>)  does  not  (in  general)  define  a  measure.  We  will 
come  back  to  this  point  later. 

Proposition  1.13  Let  ( E ,  S)  be  a  measurable  space,  and  let  gibe  a  o -finite  measure 
on  (E,  S).  There  exists,  on  an  appropriate  probability  space  (T2 ,  AP ,  P),  a  Gaussian 
white  noise  with  intensity  /x. 

Proof  We  rely  on  elementary  Hilbert  space  theory.  Let  (fi,  i  £  I)  be  a  total 
orthonormal  system  in  the  Hilbert  space  L2(E,  S,  /x).  For  every/  £  L2(E,  S,  p), 

f  =  E  “'X 

iel 
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where  the  coefficients  oti  —  (f,f)  are  such  that 


iel 


/II2  <  00. 


On  an  appropriate  probability  space  (£2,  JP,P)  we  can  construct  a  collection 
indexed  by  the  same  index  set  /,  of  independent  ^E(0, 1)  random  variables  (see  [64, 
Chapter  III]  for  the  existence  of  such  a  collection  -  in  the  sequel  we  will  only  need 
the  case  when  I  is  countable,  and  then  an  elementary  construction  using  only  the 
existence  of  Lebesgue  measure  on  [0, 1]  is  possible),  and  we  set 


G(f)  =  J2°“X- 

iel 


The  series  converges  in  L 2  since  the  X;,  i  e  /,  form  an  orthonormal  system  in 
L 2 .  Then  clearly  G  takes  values  in  the  Gaussian  space  generated  by  X;,  i  e  /. 
Furthermore,  G  is  an  isometry  since  it  maps  the  orthonormal  basis  (f,  i  e  I)  to 
an  orthonormal  system.  □ 

We  could  also  have  deduced  the  previous  result  from  Theorem  1.11  applied  with 
T  =  and  r(f,g)  =  (f,g)L 2(£><r  In  this  way  we  get  a  Gaussian 

process  ( Xf,f  e  L2(E,  g ,  /i))  and  we  just  have  to  take  G(f)  —  Xf. 

Remark  In  what  follows,  we  will  only  consider  the  case  when  L2(E,(o,ii)  is 
separable.  For  instance,  if  (E,  S’)  —  (R+,  ^(R+))  and  /z  is  Lebesgue  measure,  the 
construction  of  G  only  requires  a  sequence  (t;n)n> o  of  independent  yK(0,  1)  random 
variables,  and  the  choice  of  an  orthonormal  basis  {(pn)n> o  of  L2(M+,  «^(R+),  dt): 
We  get  G  by  setting 


Gif)  = 

n>  0 


See  Exercise  1.18  for  an  explicit  choice  of  (ipn)n> o  when  E  —  [0,1]. 

Our  last  proposition  gives  a  way  of  recovering  the  intensity  /z(A)  of  a  measurable 
set  A  from  the  values  of  G  on  the  atoms  of  finer  and  finer  partitions  of  A. 

Proposition  1.14  Let  G  be  a  Gaussian  white  noise  on  ( E ,  <§)  with  intensity  /z.  Let 
A  e  S’  be  such  that  /z(A)  <  oo.  Assume  that  there  exists  a  sequence  of  partitions 
of  A, 


A  =  A"  U  . . .  U  Ankn 
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whose  “mesh  ”  tends  to  0,  in  the  sense  that 


lim  [  sup  /x(A")  1=0. 


n—>oo 


1</<V 


Then, 


lim  E  G(A'j)1 2 3  =  ,UA) 


n—>o o 


7=1 


/ft  L2. 

Proof  For  every  fixed  ft,  the  variables  G(A"), . . . ,  G(Ank)  are  independent.  Further¬ 
more,  £[G(A")2]  =  /x(A").  We  then  compute 


E  G(A")2  -  ^A> 


7=1 


=  E  var(G(A”)2)  =  2j^^i(AJ)2, 


7=1 


7=1 


because,  if  X  is  ^F(0,  a2),  var(X2)  =  £(X4)  —  a4  =  3a4  —  a4  =  2a4.  Then 


4  _  q^-4  ^4  _ 


X]/x(aJ)2^(  sup  m(a")j/4a) 


7=1 


1  <7<V 


tends  to  0  as  n  — >  oo  by  assumption. 


□ 


Exercises 

Exercise  1.15  Let  (Xf)fG[o,i]  be  a  centered  Gaussian  process.  We  assume  that  the 
mapping  ( t ,  &>)  i->  X^u;)  from  [0, 1]  x  T2  into  R  is  measurable.  We  denote  the 
covariance  function  of  X  by  K. 

1.  Show  that  the  mapping  t  i->  Xt  from  [0,  1]  into  L2(T2)  is  continuous  if  and  only  if 
K  is  continuous  on  [0,  l]2.  In  what  follows,  we  assume  that  this  condition  holds. 

2.  Let  h  :  [0, 1]  ->  R  be  a  measurable  function  such  that  /J  \h(t)\y/K(t,  t )  dt  <  oo. 
Show  that,  for  a.e.  &>,  the  integral  /0'  /z(t)X/(u;)dt  is  absolutely  convergent.  We  set 
z=  /;'  h(t)X,dt. 

3.  We  now  make  the  stronger  assumption  \h(t)\dt  <  oo.  Show  that  Z  is  the  L 2- 

x 

limit  of  the  variables  Zn  =  J^”=1  Xx  [/!_  1  h(t)dt  when  n  — >  oo  and  infer  that  Z  is 

n  /; 

a  Gaussian  random  variable. 
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4.  We  assume  that  K  is  twice  continuously  differentiable.  Show  that,  for  every  t  e 
[0,  1],  the  limit 


s  —  t 


_  • 

exists  in  L2(Y2).  Verify  that  (W)re[o,i]  is  a  centered  Gaussian  process  and  compute 
its  covariance  function. 

Exercise  1.16  ( Kalman  filtering)  Let  (en)n>o  and  {r)n)n> o  be  two  independent 
sequences  of  independent  Gaussian  random  variables  such  that,  for  every  n ,  en  is 
distributed  according  to  yK(0,a2)  and  rjn  is  distributed  according  to  ^L(0,52), 
where  a  >  0  and  S  >  0.  We  consider  two  other  sequences  ( Xn)n>o  and  (Yn)n> o 
defined  by  the  properties  Xo  —  0,  and,  for  every  n  >  0,  Xn+\  —  anXn  +  and 
Yn  —  cXn  +  r)n,  where  c  and  an  are  positive  constants.  We  set 


Xn/n  —  E[Xn  |  Yq,  Y\, . . . ,  Yn], 

Xn-\-\/n  —  E[Xn-\-\  |  Yq,  Y\ , . . . ,  Yfi\. 

The  goal  of  the  exercise  is  to  find  a  recursive  formula  allowing  one  to  compute  these 
conditional  expectations. 

1.  Verify  that  Xn+\/n  —  anXn/n ,  for  every  n  >  0. 

2.  Show  that,  for  every  n  >  1 , 


Xnfn  —  Xn/n—  1  T 


where  Zn  . —  Yn  cXn/n—\. 

3.  Evaluate  E[XnZn\  and  E[Z2\  in  terms  of  Pn  E[(Xn  —  Xn/n~\)2]  and  infer  that, 
for  every  n  >  1 , 


X 


n+\/n 


—  a 


n 


Xn/n-1  + 


cPn 

C2P  n  +  52 


4.  Verify  that  P\  —  o2  and  that,  for  every  n  >  1 ,  the  following  induction  formula 
holds: 


^n+l  =  O2  +  a2n 


82Pn 

C2Pn  +  82  ' 


Exercise  1.17  Let  H  be  a  (centered)  Gaussian  space  and  let  H\  and  H2  be  linear 
subspaces  of  H.  Let  K  be  a  closed  linear  subspace  of  H.  We  write  pK  for  the 
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orthogonal  projection  onto  K.  Show  that  the  condition 


W,  G  H\ ,  WX2  G  H2 ,  E[XxX2\  =  £[M*i)M*2)] 


implies  that  the  cr-fields  cr(H\)  and  g(H2)  are  conditionally  independent  given 
cr(X).  (This  means  that,  for  every  nonnegative  cr  (/A) -measurable  random  variable 
Xx,  and  for  every  nonnegative  a  (H2) -measurable  random  variable  X2,  one  has 
E[XxX2\g(K)]  =  E[Xx\g(K)\E[X2\g(K)\.)  Hint :  Via  monotone  class  arguments 
explained  in  Appendix  Al,  it  is  enough  to  consider  the  case  where  X\ ,  resp.  X2, 
is  the  indicator  function  of  an  event  depending  only  on  finitely  many  variables  in 
H i,  resp.  in  H2. 

Exercise  1.18  ( Levy's  construction  of  Brownian  motion)  For  every  t  G  [0, 1],  we 
set  ho(t)  —  1,  and  then,  for  every  integer  n  >  0  and  every  k  G  {0, 1 , . . . ,  2n  —  1}, 


~  l[(2k)2~n~l 


,{2k-\r\)2~n~v)  (0  —  l[(2/:+l)2— ,7— 1  ,(2/:+2)2— ,I— 1 )  (0  • 


1.  Verify  that  the  functions  ho,  (^)w>i,o<£<2«-i  form  an  orthonormal  basis  of 
L2([0,  1],  ^([0,  1]),  d t).  (Hint:  Observe  that,  for  every  fixed  n  >  0,  any  function 
/  :  [0, 1)  ->  R  that  is  constant  on  every  interval  of  the  form  [(j  —  1)2  n,j2  n), 
for  1  <  j  <  2n,  is  a  linear  combination  of  the  functions  ho,  ( hf)o<m<n,o<k<2m-i  •) 

2.  Suppose  that  No,  (N%)n>i,o<k<2n-i  are  independent  nK(0,  1)  random  variables. 
Justify  the  existence  of  the  (unique)  Gaussian  white  noise  G  on  [0, 1],  with 
intensity  d t,  such  that  G(ho)  —  No  and  G(hnk )  =  for  every  n  >  0  and 
0  <  k  <  2n  -  1. 

3.  For  every  t  G  [0, 1],  set  Bt  —  G([0,  t]).  Verify  that 

oo  2"— 1 

=  tiv0  +  £  ( V)> 

n= 0  k= 0 

where  the  series  converges  in  L2,  and  the  functions  gnk  :  [0,1]  ->  [0,  oo)  are  given 
by 


gnk(t)=  fhl(s)  ds. 

Jo 

Note  that  the  functions  gnk  are  continuous  and  satisfy  the  following  property:  For 
every  fixed  n  >  0,  the  functions  gnk,  0  <  k  <  2n  —  1,  have  disjoint  supports  and 
are  bounded  above  by  2~n /2. 

4.  For  every  integer  m  >  0  and  every  t  G  [0, 1]  set 

m- 1  2"-l 

Bf(m)  =  ^0  +  y](y]^(o^)- 

«=()  k= 0 
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of  the  figure,  lines  become  thinner  when  m  increases.  The  lengths  of  the  dashed  segments  are 
determined  by  the  values  of  No  and  N™  for  m  =  0,  1,2 


See  Fig.  1.1  for  an  illustration.  Verify  that  the  continuous  functions  t  i->  B{r\co) 
converge  uniformly  on  [0,  1]  as  m  ->  oo,  for  a. a.  co.  (Hint:  If  N  is  «yK(0, 1)- 
distributed,  prove  the  bound  P( \ N\  >  a)  <  o~a  /2  for  a  >  1 ,  and  use  this  estimate 
to  bound  the  probability  of  the  event  {sup{|A^|  :  0  <  k  <  2n  —  1}  >  2”/4},  for 
every  fixed  n  >  0.) 

5.  Conclude  that  we  can,  for  every  t  >  0,  select  a  random  variable  B\  which  is  a.s. 
equal  to  Bu  in  such  a  way  that  the  mapping  t  i-^  B't(co)  is  continuous  for  every 
CO  G 


Notes  and  Comments 

The  material  in  this  chapter  is  standard.  We  refer  to  Adler  [1]  and  Lifshits  [55]  for 
more  information  about  Gaussian  processes.  The  more  recent  book  [56]  by  Marcus 
and  Rosen  develops  striking  applications  of  the  known  results  about  Gaussian 
processes  to  Markov  processes  and  their  local  times.  Exercise  1 . 16  involves  a  simple 
particular  case  of  the  famous  Kalman  filter,  which  has  numerous  applications  in 
technology.  See  [49]  or  [62]  for  the  details  of  the  construction  in  Exercise  1.18. 


Chapter  2 

Brownian  Motion 


In  this  chapter,  we  construct  Brownian  motion  and  investigate  some  of  its  properties. 
We  start  by  introducing  the  “ ‘pre- Brownian  motion”  (this  is  not  a  canonical 
terminology),  which  is  easily  defined  from  a  Gaussian  white  noise  on  M+  whose 
intensity  is  Lebesgue  measure.  Going  from  pre-Brownian  motion  to  Brownian 
motion  requires  the  additional  property  of  continuity  of  sample  paths,  which  is 
derived  here  via  the  classical  Kolmogorov  lemma.  The  end  of  the  chapter  discusses 
several  properties  of  Brownian  sample  paths,  and  establishes  the  strong  Markov 
property,  with  its  classical  application  to  the  reflection  principle. 


2.1  Pre-Brownian  Motion 

Throughout  this  chapter,  we  argue  on  a  probability  space  (X2,  &,P).  Most  of  the 
time,  but  not  always,  random  processes  will  be  indexed  by  T  —  M+  and  take  values 
in  R. 

Definition  2.1  Let  G  be  a  Gaussian  white  noise  on  M+  whose  intensity  is  Lebesgue 
measure.  The  random  process  (Bt)te defined  by 


Bt  —  G(l[0,r]) 


is  called  pre-Brownian  motion . 

Proposition  2.2  Pre-Brownian  motion  is  a  centered  Gaussian  process  with  covari¬ 
ance 


K(s,  t)  —  min^,  t} 


(not.) 


S  At. 
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Proof  By  the  definition  of  a  Gaussian  white  noise,  the  variables  Bt  belong  to  a 
common  Gaussian  space,  and  therefore  (Bt)t> o  is  a  Gaussian  process.  Moreover,  for 
every  s,t  >  0, 


E[BSB, ]  =  £[G([0,  s])G([0,  t])} 


drl[o,s](r)l[0,,](r)  =  s  At. 


□ 

The  next  proposition  gives  different  ways  of  characterizing  pre-Brownian 
motion. 

Proposition  2.3  Let  ( Xt)t>o  be  a  ( real-valued )  random  process.  The  following 
properties  are  equivalent: 

(i)  (Xt)t> o  is  a  pre-Brownian  motion; 

(ii)  (Xt)t>o  is  a  centered  Gaussian  process  with  covariance  K(s,t )  =  s  A  t; 

(iii)  Xq  =  0  a.s.,  and,  for  every  0  <  s  <  t,  the  random  variable  Xt  —  Xs  is 
independent  ofa(Xr,  r  <  s)  and  distributed  according  to  <sK(0,  t  —  s); 

(iv)  Xq  =  0  a.s.,  and,  for  every  choice  of  0  =  to  <  t\  <  •  •  •  <  tp,  the  variables 
Xti-xti_v  1  <i<  p  are  independent,  and,  for  every  1  <  i  <  p,  the  variable 
Xti  —  Xti_x  is  distributed  according  to  AL(0,  U  —  U-\). 

Proof  The  fact  that  (i)=>(ii)  is  Proposition  2.2.  Let  us  show  that  (ii)=^(iii).  We 
assume  that  (X,/>0  is  a  centered  Gaussian  process  with  covariance  K(s,  t)  =  s  A  t , 
and  we  let  H  be  the  Gaussian  space  generated  by  (X^>0.  Then  Xo  is  distributed 
according  to  ,/L(0, 0)  and  therefore  X0  =  0  a.s.  Then,  fix  s  >  0  and  write  Hs  for  the 
vector  space  spanned  by  (Xr,  0  <  r  <  .s1),  and  Hs  for  the  vector  space  spanned  by 
( Xs+U  —  Xs ,  u  >  0).  Then  Hs  and  Hs  are  orthogonal  since,  for  r  e  [0,  s\  and  u  >  0, 

E[Xr(Xs+u  -  X5)]  —  r  A  (s  +  u)  —  r  A  s  —  r  —  r  —  0. 

Noting  that  Hs  and  Hs  are  subspaces  of  H ,  we  deduce  from  Theorem  1.9  that  o(Hs) 
and  o(Hs)  are  independent.  In  particular,  if  we  fix  t  >  s ,  the  variable  Xt  —  Xs  is 
independent  of  o(Hs )  =  a(Xr,r  <  51).  Finally,  using  the  form  of  the  covariance 
function,  we  immediately  get  that  Xt  —  Xs  is  distributed  according  to  aK(0,  t  —  s). 

The  implication  (iii)^(iv)  is  straightforward.  Taking  s  =  tp~ \  and  t  =  tp  we 
obtain  that  Xtp  —  Xtp_x  is  independent  of  (Xh , ... ,  Xt  f).  Similarly,  Xtp_x  —  Xtp_2  is 
independent  of  {Xtx , ... ,  Xt  2),  and  so  on.  This  implies  that  the  variables  Xtj  —Xtj_x , 
1  <  iS  p,  are  independent. 

Let  us  show  that  (iv)=^(i).  It  easily  follows  from  (iv)  that  X  is  a  centered  Gaussian 
process.  Then,  if/  is  a  step  function  on  M+  of  the  form /  =  i  ^  Mu-iMb  where 
0  =  to  <  t\  <  t2  <  •  •  •  <  tp,  we  set 


G(/)  =  E 

i=  1 
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(note  that  this  definition  of  G(f)  depends  only  on  /  and  not  on  the  particular  way 
we  have  written/  =  YH=i  Suppose  then  that  /  and  g  are  two  step 

functions.  We  can  write/  =  A,- l(/,._lA]  and  g  =  £"=1  1  (r,- , ./,]  with  the 

same  subdivision  0  =  to  <  h  <  L  <  •  •  •  <  tp  for /  and  for  g  (just  take  the  union  of 
the  subdivisions  arising  in  the  expressions  off  and  g).  It  then  follows  from  a  simple 
calculation  that 


E[G(f)G(g)]  =  [  f(t)g(t )  dr, 

J  R+ 

so  that  G  is  an  isometry  from  the  vector  space  of  step  functions  on  R+  into  the 
Gaussian  space  H  generated  by  X.  Using  the  fact  that  step  functions  are  dense 
in  L2(M+,  ^(R_|_),  dt),  we  get  that  the  mapping/  i->  G(/)  can  be  extended  to 
an  isometry  from  L2(M+,  <^(M+),  d t)  into  the  Gaussian  space  H.  Finally,  we  have 
G([0,  t ])  =  Xt  —  Xo  —  Xt  by  construction.  □ 

Remark  The  variant  of  (iii)  where  the  law  of  Xt  —  Xs  is  not  specified  but  required 
to  only  depend  on  t  —  s  is  called  the  property  of  stationarity  (or  homogeneity)  and 
independence  of  increments.  Pre-Brownian  motion  is  thus  a  special  case  of  the  class 
of  processes  with  stationary  independent  increments  (under  an  additional  regularity 
assumption,  these  processes  are  also  called  Levy  processes,  see  Sect.  6.5.2). 

Corollary  2.4  Let  (Bt)t> o  be  a  pre-Brownian  motion.  Then,  for  every  choice  of  0  = 
to  <  t\  <  •  •  •  <  tn,  the  law  of  the  vector  ( Btl ,  Bt2, ,  Btn )  has  density 


1 

(2jr)n/2  yj\  (t2  -  ~t\) . . .  JTn  -  r„_i) 


(Xj  -X,-l)2\ 

2  fa  -  ti- 1)  / 


where  by  convention  xo  —  0. 

Proof  The  random  variables  Btl  ,Bt2  —  Btl, ... ,  Btn  —  Btn_x  are  independent  with 
respective  distributions  q),  ^  —  t\), ...,  XT (f),tn  —  tn~ i).  Hence  the 

vector  (j Btl ,  Btl  -  Btx , . . . ,  Btn  -  Btn_x)  has  density 


n 


q(yu...,y„)  = 


(2jt)n/2  yft Vjl  -  h)  •  •  •  (tn  -  t„-l) 


exp 


(-E 


Yi 


i=i  u~ 


i))’ 


and  the  change  of  variables  xt  —  y\  +  •  •  •  +  yt  for  i  e  { 1 , ...  ,n}  completes  the 
argument.  Alternatively  we  could  have  used  Theorem  1.3  (ii).  □ 


Remark  Corollary  2.4,  together  with  the  property  Z?o  =  0,  determines  the  collec¬ 
tion  of  finite -dimensional  marginal  distributions  of  pre-Brownian  motion.  Property 
(iv)  of  Proposition  2.3  shows  that  a  process  having  the  same  finite-dimensional 
marginal  distributions  as  pre-Brownian  motion  must  also  be  a  pre-Brownian  motion. 
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Proposition  2.5  Let  B  be  a  pre-Brownian  motion.  Then , 

(i)  —B  is  also  a  pre-Brownian  motion  ( symmetry  property ); 

1  i 

(ii)  for  every  A  >  0,  the  process  B y  =  yByif  is  also  a  pre-Brownian  motion 
( invariance  under  scaling ); 

(iii)  for  every  s  >  0,  the  process  B)  —  Bs+t  —  Bs  is  a  pre-Brownian  motion  and  is 
independent  of  a  (Br,  r  <  s)  ( simple  Markov  property ). 

Proof  (i)  and  (ii)  are  very  easy.  Let  us  prove  (iii).  With  the  notation  of  the  proof 
of  Proposition  2.3,  the  a-field  generated  by  is  o(Hs ),  which  is  independent  of 

g(Hs)  —  <r(Br,  r  <  s).  To  see  that  is  a  pre-Brownian  motion,  it  is  enough  to 
verify  property  (iv)  of  Proposition  2.3,  which  is  immediate  since  B>\f  —  b\^_{  — 

Bs+ti  Bs-\-ti—\  •  L 

Let  B  be  a  pre-Brownian  motion  and  let  G  be  the  associated  Gaussian  white  noise. 
Note  that  G  is  determined  by  B:  If  /  is  a  step  function  there  is  an  explicit  formula 
for  G(/)  in  terms  of  B ,  and  one  then  uses  a  density  argument.  One  often  writes  for 

/€L2(M+,^(R+),dO, 


G(f)  =  /  f(s)  d  Bs 

Jo 

and  similarly 

l'f(s)dBs  ,  G(/1M)=  f'f(r)  dBr  . 

Jo  Js 

This  notation  is  justified  by  the  fact  that,  if  u  <  v, 

d Bs  =  G((u,  v])  =  G([ 0,  v])  -  G([ 0,  u])  =  Bv  -  Bu. 

The  mapping/  i-^  f(s)  d Bs  (that  is,  the  Gaussian  white  noise  G)  is  then  called 
the  Wiener  integral  with  respect  to  B.  Recall  that  f^°f(s)dBs  is  distributed  according 
to  ,jV  (0,  f^°  f(s)2ds). 

Since  a  Gaussian  white  noise  is  not  a  “real”  measure  depending  on  co ,  f^°f(s)dBs 
is  not  a  “real”  integral  depending  on  co.  Much  of  what  follows  in  this  book  is  devoted 
to  extending  the  definition  of  f^°f(s)dBs  to  functions/  that  may  depend  on  co. 


2.2  The  Continuity  of  Sample  Paths 

We  start  with  a  general  definition.  Let  E  be  a  metric  space  equipped  with  its  Borel 
a-field. 


2.2  The  Continuity  of  Sample  Paths 
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Definition  2.6  Let  ( Xt)teT  be  a  random  process  with  values  in  E.  The  sample  paths 
of  A  are  the  mappings  T  3  t  i->  Xfico)  obtained  when  fixing  co  e  £2 .  The  sample 
paths  of  X  thus  form  a  collection  of  mappings  from  T  into  E  indexed  by  co  e  £2 . 

Let  B  —  (Bt)t> o  be  a  pre-Brownian  motion.  At  the  present  stage,  we  have  no 
information  about  the  sample  paths  of  B.  We  cannot  even  assert  that  these  sample 
paths  are  measurable  functions.  In  this  section,  we  will  show  that,  at  the  cost  of 
“slightly”  modifying  B ,  we  can  ensure  that  sample  paths  are  continuous. 

Definition  2.7  Let  ( Xt)teT  and  ( Xt)teT  be  two  random  processes  indexed  by  the 
same  index  set  T  and  with  values  in  the  same  metric  space  E.  We  say  that  A  is  a 
modification  of  X  if 


VteT,  P(Xt  =  Xt)  =  1. 

This  implies  in  particular  that  X  has  the  same  finite-dimensional  marginals  as  X. 
Thus,  if  A  is  a  pre-Brownian  motion,  A  is  also  a  pre-Brownian  motion.  On  the  other 
hand,  sample  paths  of  A  may  have  very  different  properties  from  those  of  A.  For 
instance,  considering  the  case  where  T  =  M+  and  E  =  R,  it  is  easy  to  construct 
examples  where  all  sample  paths  of  A  are  continuous  whereas  all  sample  paths  of  A 
are  discontinuous. 

Definition  2.8  The  process  A  is  said  to  be  indistinguishable  from  A  if  there  exists 
a  negligible  subset  N  of  £2  such  that 

Xco  e  Q\N,  Wt  e  T,  X t(co)  =  X t(co). 

Put  in  a  different  way,  A  is  indistinguishable  from  A  if 


PiXteT ,  Xt  —  Xt)  —  1. 


(This  formulation  is  slightly  incorrect  since  the  set  {Vf  e  T,  Xt  =  Xt)  need  not  be 
measurable.) 

If  A  is  indistinguishable  from  A  then  A  is  a  modification  of  A.  The  notion 
of  indistinguishability  is  however  much  stronger:  Two  indistinguishable  processes 
have  a.s.  the  same  sample  paths.  In  what  follows,  we  will  always  identify  two 
indistinguishable  processes.  An  assertion  such  as  “there  exists  a  unique  process  such 
that ...”  should  always  be  understood  “up  to  indistinguishability”,  even  if  this  is  not 
stated  explicitly. 

The  following  observation  will  play  an  important  role.  Suppose  that  T  —  I  is  an 
interval  of  R.  If  the  sample  paths  of  both  A  and  A  are  continuous  (except  possibly 
on  a  negligible  subset  of  X2),  then  A  is  a  modification  of  A  if  and  only  if  A  is 
indistinguishable  from  A.  Indeed,  if  A  is  a  modification  of  A  we  have  a.s.  A*  =  Xt 
for  every  tel  D  Q  (we  throw  out  a  countable  union  of  negligible  sets)  hence  a.s. 
Xt  —  Xt  for  every  tel,  by  a  continuity  argument.  We  get  the  same  result  if  we  only 
assume  that  the  sample  paths  are  right-continuous,  or  left-continuous. 
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Theorem  2.9  (Kolmogorov’s  lemma)  Let  X  —  {Xt)tei  be  a  random  process 
indexed  by  a  bounded  interval  I  ofR,  and  taking  values  in  a  complete  metric  space 
( E ,  d).  Assume  that  there  exist  three  reals  q,  s,C  >  0  such  that ,  for  every  s,t  E  I, 


E[d(Xs,Xt)q]  <  C\t-s 


1  +  £ 


Then,  there  is  a  modification  X  of  X  whose  sample  paths  are  Holder  continuous 
with  exponent  a  for  every  a  E  (0,  |):  This  means  that,  for  every  co  E  Q  and  every 
a  E  (0,  |),  a  finite  constant  Ca(co )  such  that,  for  every  s,t  E  I, 

d{Xs(co),Xt{co))  <  c„((»)|r-jr. 


In  particular,  X  is  a  modification  of  X  with  continuous  sample  paths  (by  the 
preceding  observations  such  a  modification  is  unique  up  to  indistinguishability). 


Remarks 

(i)  If  I  is  unbounded,  for  instance  if  I  —  M+,  we  may  still  apply  Theorem  2.9 

successively  with  /  =  [0,  1],  [1, 2],  [2,  3],  etc.  and  we  get  that  X  has  a 

modification  whose  sample  paths  are  locally  Holder  with  exponent  a  for  every 
a  E  (0 ,s/q). 

(ii)  It  is  enough  to  prove  that,  for  every  fixed  a  E  (0,  e/q),  X  has  a  modification 
whose  sample  paths  are  Holder  with  exponent  a.  Indeed,  we  can  then  apply 
this  result  to  every  choice  of  a  in  a  sequence  t  s/q,  noting  that  the  resulting 
modifications  are  indistinguishable,  by  the  observations  preceding  the  theorem. 

Proof  To  simplify  the  presentation,  we  take  I  —  [0,1],  but  the  proof  would  be  the 

same  for  any  bounded  interval  (closed  or  not).  We  fix  a  e  (0,  £). 

The  assumption  of  the  theorem  implies  that,  for  a  >  0  and  s,t  E  /, 


P(d(Xs,Xt)  >  a)  <  a~qE[d(Xs,Xt)q]  <  C a~q 


l  +  £ 


We  apply  this  inequality  io  s  —  (i  —  1)2  n,  t  —  i2  n  (for  i  E  {l, ...  ,2n})  and 
a  =  2~na: 


i- 1)2 


— n 


,Xa-n)  >  2~'fi  <  C2nqa2-{l+e)n. 


By  summing  over  i  we  get 


P\\J{d{X{i-l)2-n,Xi2-n)  >  <  2"  •  C2nqa-(l+B)n  =  C2~n(e 


— n(e—qa ) 
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By  assumption,  s  —  qa  >  0.  Summing  now  over  n ,  we  obtain 


oo  /  2” 


U {d(X(i-m-n,Xi2-n)  >  2~na)  <  OO, 


n=  1  \  i=  1 


and  the  Borel-Cantelli  lemma  implies  that,  with  probability  one,  we  can  find  a  finite 
integer  no((o)  such  that 


Consequently  the  constant  Ka  ( co )  defined  by 


is  finite  a.s.  (If  n  >  no(co),  the  supremum  inside  the  parentheses  is  bounded  above 
by  1,  and,  on  the  other  hand,  there  are  only  finitely  many  terms  before  no(co).) 

At  this  point,  we  use  an  elementary  analytic  lemma,  whose  proof  is  postponed 
until  after  the  end  of  the  proof  of  Theorem  2.9.  We  write  D  for  the  set  of  all  reals  of 
[0, 1)  of  the  form  il~n  for  some  integer  n  >  1  and  some  i  e  { 0, 1, . . . ,  2"  —  1}. 

Lemma  2.10  Let  f  be  a  mapping  defined  on  D  and  with  values  in  the  metric  space 
(. E ,  d).  Assume  that  there  exists  a  real  a  >  0  and  a  constant  K  <  oo  such  that,  for 
every  integer  n  >  1  and  every  i  G  {1, 2, . . . ,  2n  —  1}, 


1)2-*), f(i2~n))  <  K2~ 


Then  we  have,  for  every  s,t  e  D, 


t  _2_K 


We  immediately  get  from  the  lemma  and  the  definition  of  Ka(co)  that,  on  the 
event  {Ka(co)  <  oo}  (which  has  probability  1),  we  have,  for  every  s,t  e  D, 

d(Xs,Xt)  <  Ca(co)\t-s\a, 

where  Ca(co)  =  2(1  —  2 ~a)~lKa(co).  Consequently,  on  the  event  {Ka(co)  <  oo}, 
the  mapping  t  \->  Xt(co)  is  Holder  continuous  on  D,  hence  uniformly  continuous 
on  D.  Since  ( E ,  d)  is  complete,  this  mapping  has  a  unique  continuous  extension  to 
I  —  [0,1],  which  is  also  Holder  with  exponent  a.  We  can  thus  set,  for  every  t  e  [0,1] 
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where  xo  is  a  point  of  E  which  can  be  fixed  arbitrarily.  Clearly  Xt  is  a  random 
variable. 

By  the  previous  remarks,  the  sample  paths  of  the  process  X  are  Holder  with 
exponent  a  on  [0,  1].  We  still  need  to  verify  that  X  is  a  modification  of  X.  To  this 
end,  fix  t  G  [0,  1].  The  assumption  of  the  theorem  implies  that 

lim  X,  =  X, 

s — 

in  probability.  Since  by  definition  Xt  is  the  almost  sure  limit  of  Xs  when  s  ->  t, 
s  e  D,  we  conclude  that  Xt  =  Xt  a.s.  □ 

Proof  of  Lemma  2.10  Fix  s,t  e  D  with  s  <  t.  Let  p  >  1  be  the  smallest  integer 
such  that  2~p  <  t—s,  and  let  k  >  0  be  the  smallest  integer  such  that  k2~p  >  .s’.  Then, 
we  may  write 


s  —  k2  p  —  s\2  p  1  —  ...  —  Si2  p  1 
t  —  k2  p  +  £q2  p  +  £^2  p  1  +  . . .  +  s'm2  p  m , 

where  /,  m  are  nonnegative  integers  and  ej  =  0  or  1  for  every  1  <  i  <  l  and 
0  <  j  <  m.  Set 

s;  =  k2_/?  —  2— 77—1  —  ...  —  s{2~p~l  for  every  0  <  i  <  /, 

tj  —  k2~p  +  £q2_/?  +  +  . . .  +  £j2~p-j  for  every  0  <j<m. 


Then,  noting  that  s  =  si,t  =  tm  and  that  we  can  apply  the  assumption  of  the  lemma 
to  each  of  the  pairs  (sq,  *o),  (si- 1 ,  £/)  (for  1  <  i  <  /)  and  (tj- \ ,  /y)  (for  i  <j  <  m), 
we  get 


d(f(s),f(t ))  =  d(f(s,),f(tm )) 


m 


<  d(f(s0)J(t0 ))  +  y^rf(/(5«-i),/(j«))  +  'Yjdifitj- i), /(*,-)) 

j=  1 


/=  1 


m 


<  a:  a:  +  y]  a:  2 


-(P+/)a 


/=  1 


j=i 


<  2K(l-2~a)~l  2~pa 

<  2K(l-2~0,rl  (t-sf 


since  2  p  <  t  —  s.  This  completes  the  proof  of  Lemma  2.10. 
We  now  apply  Theorem  2.9  to  pre-Brownian  motion. 


□ 
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Corollary  2.11  Let  B  —  (Bt)t>o  be  a  pre-Brownian  motion.  The  process  B 
has  a  modification  whose  sample  paths  are  continuous,  and  even  locally  Holder 
continuous  with  exponent  ^  —  S  for  every  8  G  (0,  |). 

Proof  If  s  <  t,  the  random  variable  Bt  —  Bs  is  distributed  according  to  t  —  s), 
and  thus  Bt  —  Bs  has  the  same  law  as  y/t  —  s  U,  where  U  is  1).  Consequently, 

for  every  q  >  0, 


E[\Bt-Bs\q\  =  (t-s)q/2E[\U\q]  =  Cq  (t  -  s)q/2 

where  Cq  —  £[|t/|^]  <  oo.  Taking  q  >  2,  we  can  apply  Theorem  2.9  with  e  — 
|  —  1 .  It  follows  that  B  has  a  modification  whose  sample  paths  are  locally  Holder 
continuous  with  exponent  a  for  every  a  <  (q  —  2)/ (2 q).  If  q  is  large  we  can  take  a 
arbitrarily  close  to  | .  □ 

Definition  2.12  A  process  (Bt)t> o  is  a  Brownian  motion  if: 

(i)  (Bt)t> o  is  a  pre-Brownian  motion. 

(ii)  All  sample  paths  of  B  are  continuous. 

This  is  in  fact  the  definition  of  a  real  (or  linear)  Brownian  motion  started  from 
0.  Extensions  to  arbitrary  starting  points  and  to  higher  dimensions  will  be  discussed 
later. 

The  existence  of  Brownian  motion  in  the  sense  of  the  preceding  definition 
follows  from  Corollary  2.11.  Indeed,  starting  from  a  pre-Brownian  motion,  this 
corollary  provides  a  modification  with  continuous  sample  paths,  which  is  still  a 
pre-Brownian  motion.  In  what  follows  we  no  longer  consider  pre-Brownian  motion, 
as  we  will  be  interested  only  in  Brownian  motion. 

It  is  important  to  note  that  the  statement  of  Proposition  2.5  holds  without  change 
if  pre-Brownian  motion  is  replaced  everywhere  by  Brownian  motion.  Indeed,  with 
the  notation  of  this  proposition,  it  is  immediate  to  verify  that  —B,Bx,B^s)  have 
continuous  sample  paths  if  B  does. 

The  Wiener  measure.  Let  C(M+ ,  R)  be  the  space  of  all  continuous  functions  from 
M+  into  R.  We  equip  C(R+,  R)  with  the  a-field  ^  defined  as  the  smallest  a-field  on 
C(R+,  R)  for  which  the  coordinate  mappings  w  i->  w (t)  are  measurable  for  every 
t  >  0  (alternatively,  one  checks  that  coincides  with  the  Borel  a-field  on  C(M+ ,  R) 
associated  with  the  topology  of  uniform  convergence  on  every  compact  set).  Given 
a  Brownian  motion  B ,  we  can  consider  the  mapping 

T2  — >  C(R+ ,  R) 

(o  i — ^  ( 1 1 — ^  Bt(co)) 

and  one  verifies  that  this  mapping  is  measurable  (if  we  take  its  composition  with  a 
coordinate  map  w  i->  w  (t)  we  get  the  random  variable  Bt,  and  a  simple  argument 
shows  that  this  suffices  for  the  desired  measurability). 
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The  Wiener  measure  (or  law  of  Brownian  motion)  is  by  definition  the  image 
of  the  probability  measure  P(dco)  under  this  mapping.  The  Wiener  measure,  which 
we  denote  by  W(dw),  is  thus  a  probability  measure  on  C(R+,R),  and,  for  every 
measurable  subset  A  of  C(R+,  R),  we  have 

W(A)  =  P(B.  e  A), 

where  in  the  right-hand  side  B  stands  for  the  random  continuous  function  t  i-> 
We  can  specialize  the  last  equality  to  a  “cylinder  set”  of  the  form 


A  —  {w  e  C(R+,R)  :  w (r0)  €  A0,w(fi)  eAi . w(t„)  e  An}, 


where  0  =  to  <  t\  <  •  •  ■  <  tn,  and  Ao,A\, ...  ,An  e  &(M)  (recall  that  &(M)  stands 
for  the  Borel  a-field  on  R).  Corollary  2.4  then  gives 


W({w;w(t0)  e  A0,w(ri)  e  Ait . . .  ,w(t„)  e  An}) 
=  P(B,0  e  Ao,Btl  e  Ai,...,Btn  e  A„) 


=  1a0(0  )/ 

JA\  x— 


d.\  i  . . .  dx 


n 


n 


AiX-xA„  (2jr)n/2  yj t\(t2  —  tn- 1) 


exp 


/  (Xi  —  Xi-\Y  \ 
V  ^  2 (ti-ti-d)’ 


where  xo  =  0  by  convention. 

This  formula  for  the  W-measure  of  cylinder  sets  characterizes  the  probability 
measure  W.  Indeed,  the  class  of  cylinder  sets  is  stable  under  finite  intersections 
and  generates  the  a-field  ^ ,  which  by  a  standard  monotone  class  argument  (see 
Appendix  Al)  implies  that  a  probability  measure  on  ^  is  characterized  by  its  values 
on  this  class.  A  consequence  of  the  preceding  formula  for  the  W-measure  of  cylinder 
sets  is  the  (fortunate)  fact  that  the  definition  of  the  Wiener  measure  does  not  depend 
on  the  choice  of  the  Brownian  motion  B :  The  law  of  Brownian  motion  is  uniquely 
defined! 

Suppose  that  B'  is  another  Brownian  motion.  Then,  for  every  Ae^, 


P(B[  e  A)  =  W(A )  =  P(B .  g  A). 


This  means  that  the  probability  that  a  given  property  (corresponding  to  a  measurable 
subset  A  of  C(R+,R))  holds  is  the  same  for  the  sample  paths  of  B  and  for  the 
sample  paths  of  B' .  We  will  use  this  observation  many  times  in  what  follows  (see  in 
particular  the  second  part  of  the  proof  of  Proposition  2.14  below). 

Consider  now  the  special  choice  of  a  probability  space, 


Q  —  C(R+,R),  ^  P(  dw)  =  W(dw). 
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Then  on  this  probability  space,  the  so-called  canonical  process 

Xt(yt)  =  w  0) 

is  a  Brownian  motion  (the  continuity  of  sample  paths  is  obvious,  and  the  fact  that 
X  has  the  right  finite-dimensional  marginals  follows  from  the  preceding  formula  for 
the  W-measure  of  cylinder  sets).  This  is  the  canonical  construction  of  Brownian 
motion. 


2.3  Properties  of  Brownian  Sample  Paths 

In  this  section,  we  investigate  properties  of  sample  paths  of  Brownian  motion 
(Fig.  2.1).  We  fix  a  Brownian  motion  (Bt)t>Q.  For  every  t  >  0,  we  set 


=  a(Bs,s  <  t). 


Fig.  2.1  Simulation  of  a  Brownian  sample  path  on  the  time  interval  [0, 1] 
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Note  that  -¥s  C  ■¥,  if  .v  £  t.  We  also  set 

^°+  =  p)  &s- 

s>  0 


We  start  by  stating  a  useful  0—1  law. 

Theorem  2.13  (Blumenthal’s  zero-one  law)  The  G-field  ,#o+  A  trivial,  in  the 
sense  that  P(A )  =  0  <?r  1  for  every  A  G  J^o+- 

Proof  Let  0  <  0  <  4  <  •  <  4  and  let  g  :  — >  R  be  a  bounded  continuous 

function.  Also  fix  A  e  #o+  •  Then,  by  a  continuity  argument, 

£[1a  g(Bh , . . . ,  =  lim£[lA  g(Btl  -Bs,...,Btk-  Bs)]. 

If  0  <  s  <  t\ ,  the  variables  Bh  —  Be , . . . ,  Btk  —  Bs  are  independent  of  AFS  (by  the 
simple  Markov  property  of  Proposition  2.5)  and  thus  also  of  #o+-  It  follows  that 

E[\a  g(Bn B,k)\  =  lim  P(A )  E[g(Bn  -  Be, . . . ,  B,t  -  B,)\ 

=  P(A)  E[g(B, B,t)]. 

We  have  thus  obtained  that  J^o+  is  independent  of  a(Bt] , ...  ,Btk).  Since  this  holds 
for  any  finite  collection  {t\ , . . . ,  4}  of  (strictly)  positive  reals,  #0+  is  independent  of 
o(Bt,t  >  0).  However,  cr(Bt,t  >  0)  =  a(Bt,t  >  0)  since  Bo  is  the  pointwise  limit 
of  Bt  when  t  ->  0.  Since  #b+  C  cr(Bt,t  >  0),  we  conclude  that  #0+  is  independent 
of  itself,  which  yields  the  desired  result.  □ 

Proposition  2.14 

(i)  We  have,  a.s.  for  every  s  >  0, 

sup  Bs  >  0,  inf  Bs  <  0. 

0<S<£  0<S<£ 

(ii)  For  every  a  G  R,  let  Ta  —  inf{t  >  0  :  Bt  =  a}  (with  the  convention  inf  0  =  00). 
Then, 

a.s.,  Va  G  R,  Ta  <  00. 


Consequently,  we  have  a.s. 

lim  sup  B,  —  +00,  lim  inf  Bt  —  —00. 

t— >00  t—>oo 

Remark  It  is  not  a  priori  obvious  that  sup0<^<£  Bs  is  measurable,  since  this  is  an 
uncountable  supremum  of  random  variables.  However,  we  can  take  advantage  of  the 
continuity  of  sample  paths  to  restrict  the  supremum  to  rational  values  of  s  G  [0,e], 
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so  that  we  have  a  supremum  of  a  countable  collection  of  random  variables.  We  will 
implicitly  use  such  remarks  in  what  follows. 

Proof 

(i)  Let  (sp)  be  a  sequence  of  positive  reals  strictly  decreasing  to  0,  and  let 


sup  Bs  >  0 

0  <S<£p 


Since  this  is  a  monotone  decreasing  intersection,  it  easily  follows  that  A  is  ^im¬ 
measurable  (we  can  restrict  the  intersection  to  p  >  po,  for  any  choice  of  po  >  1). 
On  the  other  hand, 


P(A)  = 


lim 

p — >  oo 


sup  Bs  >  0 

0<.V<£/; 


and 


p(  sup  Bs  >  o)  >  P(BSp  >  0)  =  1 

'  0 <s<ep  J  2 

which  shows  that  P(A)  >  1/2.  By  Theorem  2.13  we  have  P(A)  =  1,  hence 

a.s.  Vs  >  0,  sup  Bs  >  0. 

0<s<s 


The  assertion  about  info<5<£  Bs  is  obtained  by  replacing  B  by  —B. 
(ii)  We  write 


1 


sup  Bs>  0 

0<.s<  1 


sup  Bs  > 

0<.s<  1 


and  we  use  the  scale  invariance  property  (see  Proposition  2.5  (ii)  and  the 
notation  of  this  proposition)  with  A  =  8  to  see  that,  for  every  8  >  0, 

p(  sup  Bs  >  8\  —  p(  sup  B8s  >  1^  =  p(  sup  Bs  >  lV 

In  the  second  equality,  we  use  the  remarks  following  the  definition  of  the 
Wiener  measure  to  observe  that  the  probability  of  the  event  {sup0<5<1  /$i  Bs  > 
1}  is  the  same  for  any  Brownian  motion  B.  If  we  let  8  go  to  0,  we  get 

p(  swpBs  >  1^  =  lim  \  p(  sup  Bs  >  1^  =  1. 

^^>0  '  HO  V  0<.s<  1  /82  ' 
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Then  another  scaling  argument  shows  that,  for  every  M  >  0, 


P 


and  replacing  B  by  —  B  we  have  also 


p(infBs  <  -m)  =  1. 

V  ,v>0  / 


The  assertions  in  (ii)  now  follow  easily.  For  the  last  one  we  observe  that  a 
continuous  function/  :  M+  — >  R  can  visit  all  reals  only  if  lim  sup t_^+OQf(t)  — 
+oo  and  lim  inf t^+OQf(t)  —  —  oo. 

□ 

Corollary  2.15  Almost  surely,  the  function  t  i->  Bt  is  not  monotone  on  any  non¬ 
trivial  interval. 

Proof  Using  assertion  (i)  of  Proposition  2.14  and  the  simple  Markov  property,  we 
immediately  get  that  a.s.  for  every  rational  q  e  Q+,  for  every  s  >  0, 


sup  Bt  >  Bq, 

q<t<q+s 


inf  B,  <  Bq. 

q<t<q+s 


The  desired  result  follows.  Notice  that  we  restricted  ourselves  to  rational  values  of  q 
in  order  to  throw  out  a  countable  union  of  negligible  sets  (and  by  the  way  the  result 
would  fail  if  we  had  considered  all  real  values  of  q).  □ 

Proposition  2.16  Let  0  =  <  f[  <  •  •  •  <  tpn  —  t  be  a  sequence  of  subdivisions  of 

[0,  t\  whose  mesh  tends  to  0  (i.e.  supj <i<Pn  —  tnt_ , )  —>  0  as  n  —>  oo).  Then, 

Pn 

iim 

n — ^  oo  ‘  J  1 

;=  1 


in  L 2 . 

Proof  This  is  an  immediate  consequence  of  Proposition  1.14,  writing  By  —By_x  — 
G((^_j ,  t}}]),  where  G  is  the  Gaussian  white  noise  associated  with  B.  □ 

If  a  <  b  and /  is  a  real  function  defined  on  [a,  b\,  the  function/  is  said  to  have 
infinite  variation  if  the  supremum  of  the  quantities  Yfi=  i  1 f(U)  —fih- 1)|,  over  all 
subdivisions  a  =  to  <  t\  <  •  •  •  <  tp  =  b,  is  infinite. 

Corollary  2.17  Almost  surely,  the  function  t  \->  Bt  has  infinite  variation  on  any 
non-trivial  interval. 

Proof  From  the  simple  Markov  property,  it  suffices  to  consider  the  interval  [0,  t\ 
for  some  fixed  t  >  0.  We  use  Proposition  2.16,  and  note  that  by  extracting  a 
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subsequence  we  may  assume  that  the  convergence  in  this  proposition  holds  a.s.  We 
then  observe  that 


- B'"- 1 


i=  1 


sup 

1  <i<Pn 


Bt 


The  supremum  inside  parentheses  tends  to  0  by  the  continuity  of  sample  paths, 
whereas  the  left-hand  side  tends  to  t  a.s.  It  follows  that  Yfi=i  I Bq  —  Bq  |  tends  to 
infinity  a.s.,  giving  the  desired  result.  □ 

The  previous  corollary  shows  that  it  is  not  possible  to  define  the  integral 
f^f(s)dBs  as  a  special  case  of  the  usual  (Stieltjes)  integral  with  respect  to  functions 
of  finite  variation  (see  Sect.  4.1.1  for  a  brief  presentation  of  the  integral  with  respect 
to  functions  of  finite  variation,  and  also  the  comments  at  the  end  of  Sect.  2.1). 


2.4  The  Strong  Markov  Property  of  Brownian  Motion 

Our  goal  is  to  extend  the  simple  Markov  property  (Proposition  2.5  (iii))  to  the  case 
where  the  deterministic  time  s  is  replaced  by  a  random  time  T.  We  first  need  to 
specify  the  class  of  admissible  random  times. 

As  in  the  previous  section,  we  fix  a  Brownian  motion  ( Bt)t>o .  We  keep  the 
notation  introduced  before  Theorem  2.13  and  we  also  set  J^oo  =  <j(Bs,s  >  0). 

Definition  2.18  A  random  variable  T  with  values  in  [0,  oo]  is  a  stopping  time  if,  for 
every  t  >  0,  {T  <  t)  e  &t. 

It  is  important  to  note  that  the  value  oo  is  allowed.  If  T  is  a  stopping  time,  we 
also  have,  for  every  t  >  0, 

{T<t}=  U  {T  <  q}  €  <?,. 

q€.  [0,?)  PlQ 

Examples  The  random  variables  T  —  t  (constant  stopping  time)  and  T  —  Ta  are 
stopping  times  (notice  that  {Ta  <  t}  =  {info<5</  \BS  —  a\  —  0}).  On  the  other  hand, 
T  —  sup{s  <  1  :  Bs  =  0}  is  not  a  stopping  time  (arguing  by  contradiction,  this  will 
follow  from  the  strong  Markov  property  stated  below  and  Proposition  2.14).  If  T  is 
a  stopping  time,  then,  for  every  t  >  0,  T  +  t  is  also  a  stopping  time. 

Definition  2.19  Let  T  be  a  stopping  time.  The  o -field  of  the  past  before  T  is 

=  {A  g  J^oo  :  Wt  >  0,  A  n  {T  <  t}  e 

It  is  easy  to  verify  that  is  indeed  a  a -field,  and  that  the  random  variable  T 
is  -measurable.  Moreover,  if  we  define  the  real  random  variable  1  {t<oo}Bt  by 
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setting 


l{r<oo}^r(&0 


Bt(co)(oj)  if  T (&>)  <  oo  , 

0  if  T (&>)  =  oo  , 


then  1  {tcoo}Bt  is  also  -measurable.  To  see  this,  we  first  observe  that 


oo 


oo 


l{r<oo}^r—  l{/2-'?< 


<r<(H-l)2-"}£>z'2 


—nxBn—n  — 


i=0 


=  lim  V  1 

w — >-oo  '  ^ 


{ r<  (i + 1 )  2~'7 } 1  { z2-"  <  r} D  a 


Bn— i 


i= 0 


We  then  note  that,  for  any  s  >  0,  Bs  l{5<r}  is  measurable,  because  if  A  is  a  Borel 

subset  of  R  not  containing  0  (the  case  where  0  e  A  is  treated  by  considering  the 
complementary  event)  we  have 


{bs1{s<t}  ^  A}  n  {r  < 


0  if  t  <  s 

{Bs  e  A}  fl  {s  <  T  <  if  £  >  s 


which  is  measurable  in  both  cases  (write  {s  <  T  <  t}  —  {T  <  t}  C\  {T  <  s}c). 

Theorem  2.20  (strong  Markov  property)  Let  T  be  a  stopping  time.  We  assume 
that  P(T  <  oo)  >  0  and  we  set,  for  every  t  >  0, 

B*P  =  l{r<oo}(^r+?  —  Bt ). 


Then  under  the  probability  measure  Pf 
Brownian  motion  independent  of  AFt- 


(T) 

T  <  oo),  the  process  ( B)  ) 


*>o  is  a 


Proof  We  first  consider  the  case  where  T  <  oo  a.s.  We  fix  A  e  and  0  <  t\  < 
^  tp->  and  we  let  F  be  a  bounded  continuous  function  from  W  into  M+ .  We  will 
verify  that 


E[  1A  F{B(y, b"])\  =  P(A )  E[F(Bt] Btp)\.  (2.1) 

The  different  assertions  of  the  theorem  then  follow.  First  the  case  A  —  T2  shows 

(T) 

that  the  process  ( B)  )t>o  has  the  same  finite-dimensional  marginal  distributions 
as  B  and  is  thus  a  Brownian  motion  (notice  that  the  sample  paths  of  B^T)  are 
continuous).  Then  (2.1)  implies  that,  for  every  choice  of  0  <  t\  <  •••  <  tp,  the 

vector  (By, B(P)  is  independent  of  and  it  follows  that  B(T)  is  independent 
of  JFT- 

Let  us  prove  (2.1).  For  every  integer  n  >  1,  and  for  every  real  t  >  0,  we  write  [t\n 
for  the  smallest  real  of  the  form  k2~n ,  with  k  e  Z+ ,  belonging  to  the  interval  [t,  oo). 
We  also  set  [oo]n  —  oo  by  convention.  In  order  to  prove  (2.1),  we  observe  that  we 
have  a.s. 


), 
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hence  by  dominated  convergence 

E[1aF(B^,...,B^)] 

=  lim  E\\a  ,  B^rl"))] 

n— >oo  1 

oo 

—  lim  >  E \\a  1  { (^— i ) 2— ” < r <k2~n } F (Bk2~n + 1\  ~  Bk2~n,  •  •  •  ,Bk2~n+tp  ~  Bk2~n)\, 

ft— >-00  '  “  1 

k= 0 


where  to  get  the  last  equality  we  have  decomposed  the  expectation  according  to  the 
possible  values  of  [T]n.  The  point  now  is  the  fact  that,  since  A  e  the  event 

A  n  {(£  —  1)2“"  <  T  <  k2~n}  =  (A  n  {T  <  k2~n})  n{T  <(k-  l)2~n}c 

is  ^k2~n  -measurable.  By  the  simple  Markov  property  (Proposition  2.5  (iii)),  we  have 
thus 


E[1  AC{(k  -\)2-n<T<k2-n}F(Bk2~n+ti  ~  Bk 2~n,  •  •  •  ,  Bk2~n+tp  ~  Bk2~n)\ 

=  P(A  n  {(k-  1)2“"  <  T  <  k2~n})  E[F(Btl , . . . ,  Btp)], 


and  we  just  have  to  sum  over  k  to  get  (2.1). 

Finally,  when  P(T  —  oo)  >  0,  the  same  arguments  give,  instead  of  (2.1), 

£[W«x>>  t%>)]  =  P(A  n{T<  oo})  E[F(Bti Btp)\ 


and  the  desired  result  again  follows  in  a  straightforward  way.  □ 

An  important  application  of  the  strong  Markov  property  is  the  “reflection 
principle”  that  leads  to  the  following  theorem. 

Theorem  2.21  For  every  t  >  0,  set  St  —  sup  s<tBs.  Then,  if  a  >  0  andb  G  (— oo,  a], 
we  have 


P(St  >  a,  Bt  <  h)  =  P(Bt  >  2 a  — b). 


In  particular,  St  has  the  same  distribution  as  | Bt 


Proof  We  apply  the  strong  Markov  property  at  the  stopping  time 


Ta  =  inf{t  >  0  \Bt  =  a}. 
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We  already  saw  (Proposition  2.14)  that  Ta  <  oo  a.s.  Then,  using  the  notation  of 
Theorem  2.20,  we  have 


P(S,  >a,B,<b)  =  P(Ta  <  t,  Bt  <  b)  =  P(Ta  <  t,  B™  <b-a ), 


since  B^da  =  B,  —  Bja  —  B,  —  a  on  the  event  {Ta  <  t}.  Write  B'  —  BtTa>,  so  that, 
by  Theorem  2.20,  the  process  B'  is  a  Brownian  motion  independent  of  ^ja  hence 
in  particular  of  Ta.  Since  B'  has  the  same  distribution  as  —  Bf ,  the  pair  ( Ta ,  Br)  also 
has  the  same  distribution  as  ( Ta ,  —  Br)  (this  common  distribution  is  just  the  product 
of  the  law  of  Ta  with  the  Wiener  measure).  Let 

H  —  {(51,  w)  G  M+  x  C(R+,  W)  :  s  <  t,  w (t  —  s)  <  b  —  a). 

The  preceding  probability  is  equal  to 

P((Ta,B')  G  H)  —  P((Ta,  -B')  G  H) 

=  P(Ta  <  t,  -B^l  <b- a) 

—  P(Ta  <t,Bt>2a-  b) 

=  P(B,  >2 a-b) 

because  the  event  {Bt  >  2a  —  b)  is  a.s.  contained  in  {Ta  <  /[.  This  gives  the  first 
assertion  (Fig.  2.2). 


Fig.  2.2  Illustration  of  the  reflection  principle:  the  conditional  probability,  knowing  that  {Ta  <  t}, 
that  the  graph  is  below  b  at  time  t  is  the  same  as  the  conditional  probability  that  the  graph  reflected 
at  level  a  after  time  Ta  (in  dashed  lines )  is  above  2a  —  b  at  time  t 
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For  the  last  assertion  of  the  theorem,  we  observe  that 


P(S,  >  a)  —  P(S,  >  a,  B,  >  a)  +  P(St  >  a,  B,  <  a)  —  2P(B,  >  a)  —  P(\B,\  >  a), 


and  the  desired  result  follows. 


□ 


It  follows  from  the  previous  theorem  that  the  law  of  the  pair  (St,  Bt )  has  density 


2(2  a-b)  (  (2  a-b)2 

g(a,  b)  =  — -j==—  exp  ( - — - )  l{a>o,b<o} 


\JlixF 


It 


(2.2) 


aA 


Corollary  2.22  For  every  a  >  0,  Ta  has  the  same  distribution  as  — z  and  has  density 


a  /  a  \  « 

/«)  =  ^=^exp(--)l 


\j2rvF 


{t>  0} 


Proof  Using  Theorem  2.21  in  the  second  equality,  we  have,  for  every  t  >  0, 

2 

P(Ta  <t)=  P(S,  >  a)  =  P(\B,\  >  a)  =  P(B2,  >  a2)  =  P(tB2  >  a2)  =  P(^  <  t). 

B\ 

Furthermore,  since  B\  is  distributed  according  to  yK(0,  1),  a  straightforward 
calculation  gives  the  density  of  a2 /B\.  □ 

Remark  From  the  form  of  the  density  of  Ta,  we  immediately  get  that  E[Ta]  —  oo. 

We  finally  extend  the  definition  of  Brownian  motion  to  the  case  of  an  arbitrary 
(possibly  random)  initial  value  and  to  any  dimension. 

Definition  2.23  If  Z  is  a  real  random  variable,  a  process  (W)r>o  is  a  real  Brownian 
motion  started  from  Z  if  we  can  write  Xt  =  Z  +  Bt  where  B  is  a  real  Brownian 
motion  started  from  0  and  is  independent  of  Z. 

Definition  2.24  A  random  process  B,  —  (B) , . . . , B dt)  with  values  in  W1  is  a  d- 
dimensional  Brownian  motion  started  from  0  if  its  components  Bl , . . . ,  Bd  are 
independent  real  Brownian  motions  started  from  0.  If  Z  a  random  variable  with 
values  in  Rd  and  Xt  =  (Xj , . . . ,  Xf)  is  a  process  with  values  in  Rd,  we  say  that  X  is 
a  d-dimensional  Brownian  motion  started  from  Z  if  we  can  write  Xt  =  Z  +  Bt  where 
B  is  a  d-dimensional  Brownian  motion  started  from  0  and  is  independent  of  Z. 

Note  that,  if  X  is  a  d-dimensional  Brownian  motion  and  the  initial  value  of  X  is 
random,  the  components  of  X  may  not  be  independent  because  the  initial  value  may 
introduce  some  dependence  (this  does  not  occur  if  the  initial  value  is  deterministic). 

In  a  way  similar  to  the  end  of  Sect.  2.2,  the  Wiener  measure  in  dimension 
d  is  defined  as  the  probability  measure  on  C(M+,RJ)  which  is  the  law  of  a 
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d-dimensional  Brownian  motion  started  from  0.  The  canonical  construction  of 
Sect.  2.2  also  applies  to  d-dimensional  Brownian  motion. 

Many  of  the  preceding  results  can  be  extended  to  d-dimensional  Brownian 
motion  with  an  arbitrary  starting  point.  In  particular,  the  invariance  properties  of 
Proposition  2.5  still  hold  with  the  obvious  adaptations.  Furthermore,  property  (i) 
of  this  proposition  can  be  extended  as  follows.  If  X  is  a  d-dimensional  Brownian 
motion  and  0  is  an  isometry  of  Rd,  the  process  (@(Xt))t>o  is  still  a  d-dimensional 
Brownian  motion.  The  construction  of  the  Wiener  measure  and  Blumenthal’s  zero- 
one  law  are  easily  extended,  and  the  strong  Markov  property  also  holds:  One  can 
adapt  the  proof  of  Theorem  2.20  to  show  that,  if  T  is  a  stopping  time  -  in  the  sense  of 
the  obvious  extension  of  Definition  2.18  -  which  is  finite  with  positive  probability, 
then  under  the  probability  measure  P(-  \  T  <  oo),  the  process  Xt  —  XT+t  =  W, 
t  >  0,  is  a  d-dimensional  Brownian  motion  started  from  0  and  is  independent  of 


Exercises 


In  all  exercises  below,  (Bt)t>o  Is  a  real  Brownian  motion  started  from  0,  and  St  — 
suPo  <s<tBs- 

Exercise  2.25  (Time  inversion) 


1.  Show  that  the  process  (W^)f>o  defined  by  Wo  =  0  and  Wt  =  tB\/t  for  t  >  0  is 
(indistinguishable  of)  a  real  Brownian  motion  started  from  0.  (Hint:  First  verify 
that  W  is  a  pre-Brownian  motion.) 


Bt 

2.  Infer  that  lim  —  =  0  a.s 


-oo 


t 


Exercise  2.26  For  every  real  a  >  0,  we  set  Ta  —  inf {t  >  0  :  Bt  —  a}.  Show  that  the 
process  (Ta)a> o  has  stationary  independent  increments,  in  the  sense  that,  for  every 
0  <  a  <  b,  the  variable  T &  —  Ta  is  independent  of  the  a-field  o(Tc,Q  <  c  <  a)  and 
has  the  same  distribution  as  Tb-a. 


Exercise  2.27  (Brownian  bridge) 

We  set  Wt  —  Bt  —  tB\  for  every  t  e  [0,  1]. 

1.  Show  that  (W;Xe[o,i]  is  a  centered  Gaussian  process  and  give  its  covariance 
function. 

2.  Let  0  <  t\  <  t2  <  •  •  •  <  tp  <  1.  Show  that  the  law  of  (Wt] ,  Wt2, . . . ,  Wtp)  has 
density 


g(x  1, .  ..,xp)=  \[2n  ph  (xi)p,2-tl  (x2  -  Xi)  •  •  -Ptp-tp-i  (xp  ~  xp-i)pi-tp(-xp ), 


where pt(x)  —  ~^=  exp(— v2/2 1).  Explain  why  the  law  of  (Wtl ,  Wt2 , ,  Wtp)  can 
be  interpreted  as  the  conditional  law  of  (Bh ,  BT2, ... ,  Btp)  knowing  that  B\  —  0. 
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3.  Verify  that  the  two  processes  (Wt)te[o,i\  and  (Wi-^^i]  have  the  same  distribu¬ 
tion  (similarly  as  in  the  definition  of  Wiener  measure,  this  law  is  a  probability 
measure  on  the  space  of  all  continuous  functions  from  [0,  1]  into  R). 

Exercise  2.28  ( Local  maxima  of  Brownian  paths)  Show  that,  a.s.,  the  local 
maxima  of  Brownian  motion  are  distinct:  a.s.,  for  any  choice  of  the  rational  numbers 
p,  q,  r,s  >  0  such  that  p  <  q  <  r  <  s  we  have 

sup  B,  ^  sup  Bt  . 

p<t<q  r<t<s 


Exercise  2.29  {Non- differentiability)  Using  the  zero-one  law,  show  that,  a.s. 


r  B* 

hm  sup  — —  =  +oo 

40  v  t 


Bt 

liminf  — —  =  — oo  . 

40  \ft 


Infer  that,  for  every  s  >  0,  the  function  t  Bt  has  a.s.  no  right  derivative  at  s. 

Exercise  2.30  {Zero  set  of  Brownian  motion)  Let  H  {t  e  [0, 1]  :  Bt  =  0}.  Using 
Proposition  2.14  and  the  strong  Markov  property,  show  that  H  is  a.s.  a  compact 
subset  of  [0, 1]  with  no  isolated  point  and  zero  Lebesgue  measure. 

Exercise  2.31  {Time  reversal)  We  set  B't  —  B\  —B\-t  for  every  t  e  [0, 1].  Show  that 
the  two  processes  (5f)fe[o,i]  and  {B't)te[ o,i]  have  the  same  law  (as  in  the  definition  of 
Wiener  measure,  this  law  is  a  probability  measure  on  the  space  of  all  continuous 
functions  from  [0,  1]  into  R). 

Exercise  2.32  {Arcsine  law) 

Set  T  :=  inf{t  >0:Bt  =  Si}. 

1.  Show  that  T  <  1  a.s.  (one  may  use  the  result  of  the  previous  exercise)  and  then 
that  T  is  not  a  stopping  time. 

2.  Verify  that  the  three  variables  St9  St  —  Bt  and  \Bt  \  have  the  same  law. 

3.  Show  that  T  is  distributed  according  to  the  so-called  arcsine  law,  whose  density 
is 


1 

7ty/t(l—  t) 


1(0,1)  (0- 


4.  Show  that  the  results  of  questions  1.  and  3.  remain  valid  if  T  is  replaced  by 
L  sup{t  <1:^  =  0}. 

Exercise  2.33  {Law  of  the  iterated  logarithm) 

The  goal  of  the  exercise  is  to  prove  that 

Bt 

lim  sup  .  -  =  1  a.s. 

t — yoo  V 2*  log  log* 
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We  set  h(t )  =  *J2t  log  log  t. 

1.  Show  that,  for  every  t  >  0,  P(S,  >  Usft)  ~  —  e  M  ^2,  when  ->  +oo. 

u\J2n 

2.  Let  r  and  c  be  two  real  numbers  such  that  1  <  r  <  c2.  From  the  behavior  of  the 
probabilities  P(Srn  >  ch(rn~1))  when  n  ->  oo,  infer  that,  a.s., 


Bt 

lim  sup  —  <  1 . 

?-+oo  v^loglog^ 


3.  Show  that  a.s.  there  are  infinitely  many  values  of  n  such  that 


By-n  Brn  —  1  ^ 


r  —  1 


h(rn) 


Conclude  that  the  statement  given  at  the  beginning  of  the  exercise  holds. 

Bt 

4.  What  is  the  value  of  lim  inf 


? 


^/2t\og  1  ogt 


Notes  and  Comments 

The  first  rigorous  mathematical  construction  of  Brownian  motion  is  due  to 
Wiener  [81]  in  1923.  We  use  the  nonstandard  terminology  of  “pre-Brownian 
motion”  to  emphasize  the  necessity  of  choosing  an  appropriate  modification  in 
order  to  get  a  random  process  with  continuous  sample  paths.  There  are  several 
constructions  of  Brownian  motion  from  a  sequence  of  independent  Gaussian 
random  variables  that  directly  yield  the  continuity  property,  and  a  very  elegant 
one  is  Levy’s  construction  (see  Exercise  1.18),  which  can  be  found  in  the  books 
[49]  or  [62].  Levy’s  construction  avoids  the  use  of  Kolmogorov’s  lemma,  but  the 
latter  will  have  other  applications  in  this  book.  We  refer  to  Talagrand’s  book  [78] 
for  far-reaching  refinements  of  the  “chaining  method”  used  above  in  the  proof  of 
Kolmogorov’s  lemma.  Much  of  what  we  know  about  linear  Brownian  motion  comes 
from  Levy,  see  in  particular  [54,  Chapitre  VI].  Perhaps  surprisingly,  the  strong 
Markov  property  of  Brownian  motion  was  proved  only  in  the  1950s  by  Hunt  [32] 
(see  also  Dynkin  [19]  for  a  more  general  version  obtained  independently  of  Hunt’s 
work),  but  it  had  been  used  before  by  other  authors,  in  particular  by  Levy  [54], 
without  a  precise  justification.  The  reflection  principle  and  its  consequences  already 
appeared,  long  before  Brownian  motion  was  rigorously  constructed,  in  Bachelier’s 
thesis  [2],  which  was  a  pioneering  work  in  financial  mathematics.  The  book  [62] 
by  Morters  and  Peres  is  an  excellent  source  for  various  sample  path  properties  of 
Brownian  motion. 


Chapter  3 

Filtrations  and  Martingales 


In  this  chapter,  we  provide  a  short  introduction  to  the  theory  of  continuous  time 
random  processes  on  a  filtered  probability  space.  On  the  way,  we  generalize  several 
notions  introduced  in  the  previous  chapter  in  the  framework  of  Brownian  motion, 
and  we  provide  a  thorough  discussion  of  stopping  times.  In  a  second  step,  we 
develop  the  theory  of  continuous  time  martingales,  and,  in  particular,  we  derive 
regularity  results  for  sample  paths  of  martingales.  We  finally  discuss  the  optional 
stopping  theorem  for  martingales  and  supermartingales,  and  we  give  applications  to 
explicit  calculations  of  distributions  related  to  Brownian  motion. 


3.1  Filtrations  and  Processes 

Throughout  this  chapter,  we  consider  a  probability  space  (T2,  P).  In  this  section, 

we  introduce  some  general  notions  that  will  be  of  constant  use  later. 

Definition  3.1  A  filtration  on  (C2,^,P)  is  a  collection  indexed  by 

[0,  oo]  of  sub-a-fields  of  such  that  c  f°r  every  s  <  t  <  o o. 

We  have  thus,  for  every  0  <  s  <  t, 

^oC^C^C^ooC^. 

We  also  say  that  (J^),  P)  is  a  filtered  probability  space 

Example  If  B  is  a  Brownian  motion,  we  considered  in  Chap.  2  the  filtration 

&t  =  cr(Bs,0  <s<t ),  J^oo  =  cr(Bs,s  >  0). 
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More  generally,  if  X  =  (Xt,  t  >  0)  is  any  random  process  indexed  by  R+,  the 
canonical  filtration  of  X  is  defined  by  —  a(Xy,0  <  s  <  t)  and  = 
ct(XS9s  >  0). 

Let  (^t)o<t<oo  be  a  filtration  on  (£2,  P).  We  set,  for  every  t  >  0 

^ , 

s>t 

and  JW  =  Note  that  ^  c  for  every  t  G  [0,  oo].  The  collection 

(J^+)0</<oo  is  also  a  filtration.  We  say  that  the  filtration  (J^i)  is  right-continuous  if 

Vt  >  0. 

By  construction,  the  filtration  (&t+)  is  right-continuous. 

Let  be  a  filtration  and  let  be  the  class  of  all  (^oo,  P) -negligible  sets  (i.e. 
A  e  /  if  there  exists  an  A'  G  J^oo  such  A  cA;  and  P(A')  =  0).  The  filtration  is 
said  to  be  complete  if  JY  C  (and  thus  /c^  for  every  t). 

If  (J^)  is  not  complete,  it  can  be  completed  by  setting  v  a(t/L),  for 

every  t  g  [0,  oo],  using  the  notation  vcr(yY)  for  the  smallest  a-field  that  contains 
both  and  <j(sY)  (recall  that  cr(yY)  is  the  a-field  generated  by  JY).  We  will 
often  apply  this  completion  procedure  to  the  canonical  filtration  of  a  random  process 
pQ*>o,  and  call  the  resulting  filtration  the  completed  canonical  filtration  of  X.  The 
reader  will  easily  check  that  all  results  stated  in  Chap.  2,  where  we  were  considering 
the  canonical  filtration  of  a  Brownian  motion  B,  remain  valid  if  instead  we  deal 
with  the  completed  canonical  filtration.  The  point  is  that  augmenting  a  a-field  with 
negligible  sets  does  not  alter  independence  properties. 

Let  us  turn  to  random  processes,  which  in  this  chapter  will  be  indexed  by  M+ . 

Definition  3.2  A  process  X  —  {Xt)t>o  with  values  in  a  measurable  space  ( E ,  S)  is 
said  to  be  measurable  if  the  mapping 

(&) ,  t)  i — >  Xf{cci) 

defined  on  £2  x  M+  equipped  with  the  product  a-field  &  ®  <^(R+)  is  measurable. 
(We  recall  that  «^(R+)  stands  for  the  Borel  a-field  of  R.) 

This  is  stronger  than  saying  that,  for  every  t  >  0,  Xt  is  ^-measurable.  On  the 
other  hand,  considering  for  instance  the  case  where  E  =  R,  it  is  easy  to  see  that 
if  the  sample  paths  of  X  are  continuous,  or  only  right-continuous,  the  fact  that  Xt 
is  ^-measurable  for  every  t  implies  that  the  process  is  measurable  in  the  previous 
sense  -  see  the  argument  in  the  proof  of  Proposition  3.4  below. 

In  the  remaining  part  of  this  chapter,  we  fix  a  filtration  (J^i)  on  (X2,  J^,P), 
and  the  notions  that  will  be  introduced  depend  on  the  choice  of  this  filtration. 

Definition  3.3  A  random  process  (Xt)t>o  with  values  in  a  measurable  space  (E,  S) 
is  called  adapted  if,  for  every  t  >  0,  Xt  is  ^-measurable.  This  process  is  said  to  be 
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progressive  if,  for  every  t  >  0,  the  mapping 

(co,s)  i->  Xs (co) 

defined  on  Q  x  [0,  t\  is  measurable  for  the  a -field  XX t  (g)  X$([0,  t]). 

Note  that  a  progressive  process  is  both  adapted  and  measurable  (saying  that  a 
process  is  measurable  is  equivalent  to  saying  that,  for  every  t  >  0,  the  mapping 
(co,s)  i->  Xs(co)  defined  on  Q  x  [0,  t\  is  measurable  for  XX  (g)  XX  Iff),  t])). 

Proposition  3.4  Let  (X^>o  be  a  random  process  with  values  in  a  metric  space 
( E,d )  (equipped  with  its  Borel  o -field).  Suppose  that  X  is  adapted  and  that  the 
sample  paths  ofX  are  right -continuous  (i.e.for  every  co  e  Q,  t  i->  Xt(oo)  is  right- 
continuous).  ThenX  is  progressive.  The  same  conclusion  holds  if  one  replaces  right- 
continuous  by  left-continuous. 

Proof  We  treat  only  the  case  of  right-continuous  sample  paths,  as  the  other  case  is 
similar.  Fix  t  >  0.  For  every  n  >  1  and  s  e  [0,  t],  define  a  random  variable  Xns  by 
setting 


X "  —Xkt/n  if  s  e  [(k-  l)t/n,  kt/n),  k  e  {l, . . .  ,n}, 

and  X"  —  X,.  The  right-continuity  of  sample  paths  ensures  that,  for  every  s  e  [0,  t\ 
and  a >  e 


Xs(co)  =  lim  XnX(o). 

n—>o o 

On  the  other  hand,  for  every  Borel  subset  A  of  E, 


{(m,  s)  e  £2  x  [0,  t ]  :  X"(cu)  e  A}  =  ({X,  e  A}  x  {t}) 


n 


U(U(!Xw,e,  ,x«|))) 


^=1 


which  belongs  to  the  a-field  <g>  ^([0,  t]).  Hence,  for  every  n  >  1,  the  mapping 
(u;,^)  i-^  Xns(<jL>),  defined  on  T2  x  [0,  t\,  is  measurable  for  XX t  (g)  ^([0,  t]).  Since  a 
pointwise  limit  of  measurable  functions  is  also  measurable,  the  same  measurability 
property  holds  for  the  mapping  (co,  s)  \->  Xs(co )  defined  on  X2  x  [0,  t\.  It  follows  that 
the  process  X  is  progressive.  □ 

The  progressive  a-field  The  collection  XX  of  all  sets  «^(R+)  such  that 

the  process  Xt(co)  =  1  a  (to,  t)  is  progressive  forms  a  a-field  on  X2  x  R+,  which  is 
called  the  progressive  a-field.  A  subset  A  of  X2  x  M+  belongs  to  XX  if  and  only  if, 
for  every  t  >  0,  A  D  (X2  x  [0,  t])  belongs  to  XXt  <g>  Xd([0,  t]). 

One  then  verifies  that  a  process  X  is  progressive  if  and  only  if  the  mapping 
(co,  t)  i-^  Xt(co)  is  measurable  on  X2  x  M+  equipped  with  the  a-field  XX. 
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3.2  Stopping  Times  and  Associated  a  -Fields 

In  this  section,  we  extend  certain  notions  that  were  already  introduced  in  the 
previous  chapter  in  the  framework  of  Brownian  motion.  The  following  definition 
is  just  a  repetition  of  the  corresponding  one  in  the  previous  chapter. 

Definition  3.5  A  random  variable  T  :  Q  — >  [0,  oo]  is  a  stopping  time  of  the 
filtration  (J^)  if  {T  <  *}  G  for  every  t  >  0.  The  o -field  of  the  past  before  T  is 
then  defined  by 


&T  =  {A  g  ^oo  :  Vf  >  0,  A  n  {T  <  t)  G 
The  reader  will  verify  that  is  indeed  a  a -field. 

In  what  follows,  “stopping  time”  will  mean  stopping  time  of  the  filtration  (J^) 
unless  otherwise  specified.  If  T  is  a  stopping  time,  we  also  have  {T  <  t}  G  for 
every  t  >  0,  by  the  same  argument  as  in  the  Brownian  case,  and  moreover 

oo 

{T=oo}=  ((J{r<n})%^ oo. 

n=  1 


Recall  the  definition  of  the  filtration  (J^+).  A  stopping  time  (of  the  filtration 
(J^))  is  obviously  also  a  stopping  time  of  the  filtration  (J^+),  but  the  converse 
need  not  be  true  in  general. 

Proposition  3.6  Write  &t  —  t+for  every  t  G  [0,  oo]. 

(i)  A  random  variable  T  :  T2  — >  [0,  oo]  is  a  stopping  time  of  the  filtration  (&t)  if 
and  only  if  {T  <  t}  G  for  every  t  >  0.  This  is  also  equivalent  to  saying  that 
T  At  is  ^-measurable  for  every  t  >  0. 

(ii)  Let  T  be  a  stopping  time  of  the  filtration  (&t).  Then 

=  {A  g  :  Wt  >  0,  A  n  {T  <  t}  g 


We  will  write 


Proof 

(i)  Suppose  that  T  is  a  stopping  time  of  the  filtration  (%).  Then,  for  every  t  >  0, 

{T  <  t}  —  |J  {T<q}e&, 

qeQ+,q<t 
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because  {T  <  q}  e  C  if  q  <  t.  Conversely,  assume  that  {T  <  t}  e  APt 
for  every  t  >  0.  Then,  for  every  t  >  0  and  s  >  t, 

{T<t}=  f)  {T<q}e&, 

^rGQ_|_,  t<q<s 


and  it  follows  that  {T  <  t}  e  APt+  =  %. 

Then,  saying  that  T  A  t  is  ^-measurable  for  every  t  >  0  is  equivalent  to 
saying  that,  for  every  s  <  t,  {T  <  s}  e  ^t.  Taking  a  sequence  of  values  of  s  that 
increases  to  t,  we  see  that  the  latter  property  implies  that  {T  <  t}  e  and  so 
T  is  a  stopping  time  of  the  filtration  (A?t) .  Conversely,  if  T  is  a  stopping  time  of 
the  filtration  (Sf,),  we  have  {T  <  s}  e  A?s  C  whenever  s  <  t,  and  thus  T  At 
is  ^-measurable. 

(ii)  First,  if  A  e  we  have  A  n  {T  <  t}  e  %  for  every  t  >  0.  Hence,  for  t  >  0, 

a  n  {t  <  t}  =  U  (An{r<?})e^( 

qeQ+,q<t 


since  A  P  {T  <  q)  G  A?q  C  for  every  q  <  t. 

Conversely,  assume  that  A  D  {T  <  t}  e  Afit  for  every  t  >  0.  Then,  for  every 
t  >  0,  and  s  >  t, 

A  n  {T  <  t}  =  P)  (a  n  {T  <  <?})  e  &s. 

qEQ-\-,t<q<s 


In  this  way,  we  get  that  A  Pi  {T  <  t}  e  APt+  =  %  and  thus  A  e&r- 


□ 


Properties  of  stopping  times  and  of  the  associated  a  -fields 

(a)  For  every  stopping  time  T,  we  have  Afij  C  •  If  the  filtration  ( is  right- 
continuous,  we  have  &t+  — 

(b)  If  T  —  t  is  a  constant  stopping  time,  APj  —  Afit  and  &t+  —  APt+  • 

(c)  Let  T  be  a  stopping  time.  Then  T  is  -measurable. 

(d)  Let  T  be  a  stopping  time  and  A  e  Set 


T  (< co )  if  co  e  A  , 
Too  if  co  £  A  . 


Then  A  e  APt  if  and  only  if  TA  is  a  stopping  time. 

(e)  Let  S,  T  be  two  stopping  times  such  that  S  <  T.  Then  A^s  C  AAj  and  A^s+  C 
^r+. 

(f)  Let  S ,  T  be  two  stopping  times.  Then,  S  v  T  and  S  aT  are  also  stopping  times 

and  Cl  Furthermore,  {S  <  T}  e  A^sat  and  {S  —  T}  e  A^sat- 
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(g)  If  (Sn)  is  a  monotone  increasing  sequence  of  stopping  times,  then  S  —  lim  \  Sn 
is  also  a  stopping  time. 

(h)  If  ( Sn )  is  a  monotone  decreasing  sequence  of  stopping  times,  then  S  =  lim  |  Sn 
is  a  stopping  time  of  the  filtration  (#)+),  and 

A+  —  P)  An+- 

n 

(i)  If  (Sn)  is  a  monotone  decreasing  sequence  of  stopping  times,  which  is  also 
stationary  (in  the  sense  that,  for  every  co ,  there  exists  an  integer  N(co)  such 
that  Sn(co)  —  S(co)  for  every  n  >  N(co))  then  S  =  lim  |  Sn  is  also  a  stopping 
time,  and 


(j)  Let  T  be  a  stopping  time.  A  function  co  i->  Y(co )  defined  on  the  set  {T  <  00} 
and  taking  values  in  the  measurable  set  (E,  S’)  is  J^-measurable  if  and  only  if, 
for  every  t  >  0,  the  restriction  of  Y  to  the  set  {T  <  t)  is  measurable. 

Remark  In  property  (j)  we  use  the  (obvious)  notion  of  -measurability  for  a 
random  variable  co  i->  Y(co)  that  is  defined  only  on  a  ^-measurable  subset  of  Q 
(here  ^  is  a  a-field  on  £?).  This  notion  will  be  used  again  in  Theorem  3.7  below. 

Proof  (a),  (b)  and  (c)  are  almost  immediate  from  our  definitions.  Let  us  prove  the 
other  statements. 

(d)  For  every  t  >  0, 


{ta  <  t}  =  a  n  {t  <  ?} 

and  the  result  follows  from  the  definition  of 

(e)  It  is  enough  to  prove  that  C  If  A  e  ^5,  we  have 

A  n  {T  <  t)  =  (A  n  {5  <  *})  n  {r  <  t}  e  &u 


hence  A  e 

(f)  We  have 


{S  A  T  <  t)  =  {S  <  t}  U  {T  <  t)  G 
{5  v  T  <  =  {5  <  t}  n  {T  <  t}  G 

so  that  S  A  T  and  S  v  T  are  stopping  times. 

It  follows  from  (e)  that  Y^sat  C  (J^s  Pi  J^).  Moreover,  if  A  e  ^  FI  #7, 

a  n  {s  a  t  <  t}  =  (A  n  {5  <  *})  u  (A  n  {r  <  *})  e  &u 


hence  A  e  ^sat- 
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Then,  for  every  t  >  0, 

{S  <  T}  n  {T  <  t}  =  {S  <  t}  n  {T  <  t}  n  {S  A  t  <  T  A  t}  e 
{S  <  T}  n  {S  <  t}  =  {5  A  t  <  T  A  t}  n  {S  <  t}  e  &u 

because  S  At  and  T  A  t  are  both  ^-measurable  by  Proposition  3.6  (i).  It  follows 
that  {S<T}  =  JW  Then  {5  =  T}  =  {S  <  T}  n  {T  <  5}. 

(g)  For  every  t  >  0, 


{5  <  t}  =  <  t)  e  ft. 

n 


(h)  Similarly 


{S<t}  =  |J{5„  <t}e  ftt, 

n 


and  we  use  Proposition  3.6  (i).  Then,  by  (e),  we  have  &$+  C  &sn+  for  every  n , 
and  conversely,  if  A  g  ^s„+, 

a  n  {5  <  t}  =  (J(A  n  {s„  <  t})  €  ft„ 


hence  A  G  &s+- 

(i)  In  that  case,  we  also  have 

{S<t}  =  (J {Sn  <t}e  ft„ 

n 


and  if  A  e  f|  nA„, 


a  n  {s  <  t)  =  y  (A  n  {s„  <  /})  G  ft„ 

n 


so  that  A  G  &s- 

(j)  First  assume  that,  for  every  t  >  0,  the  restriction  of  Y  to  {T  <  t}  is  &t- 
measurable.  Then,  for  every  measurable  subset  A  of  E, 

{Y  G  A}  fl  {T  <  t)  G  &t. 

Letting  t  ->  oo,  we  first  obtain  that  {T  G  A}  G  J^o,  and  then  we  deduce  from 
the  previous  display  that  {y  G  A}  G  &T. 

Conversely,  if  Y  is  -measurable,  {y  G  A}  G  and  thus  {y  G  A}  fl  {T 
t)  G  giving  the  desired  result. 


VI  □ 
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Theorem  3.7  Let  (Xt)t> o  be  a  progressive  process  with  values  in  a  measurable 
space  (E,^),  and  let  T  be  a  stopping  time.  Then  the  function  co  i->  XT(co)  :  = 
which  is  defined  on  the  event  {T  <  oo},  is  -measurable. 

Proof  We  use  property  (j)  above.  Let  t  >  0.  The  restriction  to  {T  <  t}  of  the 
function  co  i->  XT(co )  is  the  composition  of  the  two  mappings 

{ T  <  3  CO  I — >  (u),  T(co)  A  f) 

&t  0  ^([0,  f]) 


and 


T2  x  [0,  t]  3  (&) ,  ,s)  i — ^  JL  (uf) 

&t  0  ^([0,  f]) 


which  are  both  measurable  (the  first  one  since  T  A  t  is  measurable,  by 
Proposition  3.6  (i),  and  the  second  one  by  the  definition  of  a  progressive  process).  It 
follows  that  the  restriction  to  {T  <  t}  of  the  function  co  i->  XT{co)  is  ^-measurable, 
which  gives  the  desired  result  by  property  ( j).  □ 

Proposition  3.8  Let  T  be  a  stopping  time  and  let  S  be  an  -measurable  random 
variable  with  values  in  [0,  oo],  such  that  S  >  T.  Then  S  is  also  a  stopping  time. 

In  particular,  if  T  is  a  stopping  time, 


oo 


Tn  =  £ 


k+  1 


>n 


1{&2— ”<r< A+i)2— +  °°  *  l{r=oo}  » 


k=0 


defines  a  sequence  of  stopping  times  that  decreases  to  T. 
Proof  For  the  first  assertion,  we  write,  for  every  t  >  0, 


n  —  0, 1, 2, . . . 


{S<t}  =  {S<t}n{T<t}e 

since  {S  <  t}  is  measurable.  The  second  assertion  follows  since  Tn  >  T,  and 
is  a  function  of  T,  hence  -measurable,  and  |  T  as  n  \  oo  by  construction.  □ 

The  following  proposition  will  be  our  main  tool  to  construct  stopping  times 
associated  with  random  processes. 

Proposition  3.9  Let  (Xt)t> o  be  an  adapted  process  with  values  in  a  metric  space 
(E,  d). 

(i)  Assume  that  the  sample  paths  of  X  are  right -continuous,  and  let  O  be  an  open 
subset  of  E.  Then 


T0  =  inf {t  >  0  :XteO} 


is  a  stopping  time  of  the  filtration  (i£+). 
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(ii)  Assume  that  the  sample  paths  ofX  are  continuous,  and  let  F  be  a  closed  subset 
of  E.  Then 


Tf  =  inf{t  >  0  :XteF} 

is  a  stopping  time  (of  the  filtration  (jZ)) ). 

Proof 

(i)  For  every  t  >  0, 


{T0  <t}=  (J  {X,  e  O}  e  &t, 

j'G  [0,9  nQ 


and  we  use  Proposition  3.6  (i). 
(ii)  For  every  t  >  0, 


{TF<t}  =  j  inf  d(Xs,F) 

(  0 <s<t 


=  ol  =  f  inf  J(Xs,F)  =  o} 
)  i  j€[o,dnQ  ) 


□ 


3.3  Continuous  Time  Martingales  and  Supermartingales 

Recall  that  we  have  fixed  a  filtered  probability  space  (X2,  P).  In  the 

remaining  part  of  this  chapter,  all  processes  take  values  in  R.  The  following  is  an 
obvious  analog  of  the  corresponding  definition  in  discrete  time  (see  Appendix  A2 
below). 

Definition  3.10  An  adapted  real- valued  process  (Xt)t>o  such  that  Xt  e  L1  for  every 
t  >  0  is  called 

•  a  martingale  if,  for  every  0  <  s  <  t,  E[Xt  |  &s\  =  X,; 

•  a  supermartingale  if,  for  every  0  <  s  <  t,  E[Xt  \  TFS\  <  Xs\ 

•  a  submartingale  if,  for  every  0  <  ^  <  t,  E[Xt  \  &s]  >  Xs. 

If  (Xt)t>o  is  a  submartingale,  (— Xt)t>o  is  a  supermartingale.  For  this  reason,  some 
of  the  results  below  are  stated  for  supermartingales  only,  but  the  analogous  results 
for  submartingales  immediately  follow. 

If  (Xt)t> o  is  a  martingale  (resp.  a  supermartingale,  resp.  a  submartingale),  we 
ha ve£[XJ  =  E[Xt\  (resp.  E[XS\  >  E[Xt],  resp.  E[XS\  <  E[Xt])  whenever  0  <  s  <  t. 

A  simple  way  to  construct  a  martingale  is  to  take  a  random  variable  Z  e  L1  and 
to  set  Xr  —  E[Z  |  for  every  t  >  0.  Not  all  martingales  are  of  this  type,  however. 
Let  us  turn  to  an  important  class  of  examples. 


50 


3  Filtrations  and  Martingales 


Important  example  We  say  that  a  process  (Zt)t> o  with  values  in  R  or  in  Wl  has 
independent  increments  with  respect  to  the  filtration  (^t)  if  Z  is  adapted  and  if,  for 
every  0  <  s  <  t,  Zt  —  Zs  is  independent  of  (for  instance,  a  Brownian  motion  has 
independent  increments  with  respect  to  its  canonical  filtration).  If  Z  is  a  real- valued 
process  having  independent  increments  with  respect  to  (#)),  then 

(i)  if  Zt  G  L1  for  every  t  >  0,  then  Zt  —  Zt  —  E[Zt\  is  a  martingale; 

(ii)  if  Zt  G  L2  for  every  t  >  0,  then  Yt  —  Z2  —  E[Z 2]  is  a  martingale; 

(iii)  if,  for  some  0  G  M,  we  have  £[e0Zr]  <  oo  for  every  t  >  0,  then 

Q0Zt 

Xt  ~  E[eez>] 


is  a  martingale. 

Proofs  of  these  facts  are  very  easy.  In  the  second  case,  we  have  for  every  0  <  s  <  t, 

E[(Z,)2  |  &s]  =  E[(%  +  Zt-  Zsf  |  &s] 

=  Z2+  2 %E\Zt  -  %  |  &,]  +  E[{Z,  -  Zsf  | 

=  Z2+E[(%-Zs)2} 

=  Z2+  E[Z2]  -  2 E[ZsZt]  +  E[Z2s] 

=  Z2+  E[Z;}  -  E\Z% 

because  E[ZsZt\  —  E[ZsE[Zt  \  ZPS]\  —  E[Z2\.  The  desired  result  follows.  In  the  third 
case, 


E[Xt  |  &s\  = 


o°Zs  E\o®^Zt  Zs^  | 
E\zezs\  E[q0^~zs)\ 


using  the  fact  that  E[te^Zt  Zs)  \  ZPS\  —  E[te^Zt  z^]  by  independence. 

Consider  the  special  case  of  Brownian  motion. 

Definition  3.11  A  real- valued  process  B  —  (Bt)t> 0  is  an  (jZ^)- Brownian  motion 
if  B  is  a  Brownian  motion  and  if  B  is  adapted  and  has  independent  increments 
with  respect  to  (J^).  Similarly,  a  process  B  =  {Bt)t> 0  with  values  in  Wl  is  a  d- 
dimensional  (jZ^)- -Brownian  motion  if  B  is  a  d-dimensional  Brownian  motion  and  if 
B  is  adapted  and  has  independent  increments  with  respect  to  (^t) . 

Note  that  if  B  is  a  (d-dimensional)  Brownian  motion  and  is  the  (possibly 
completed)  canonical  filtration  of  B ,  then  B  is  a  (d-dimensional)  (#)5) -Brownian 
motion. 

Let  B  be  an  (jZ^)- -Brownian  motion  started  from  0  (or  from  any  a  G  R).  Then  it 
follows  from  the  above  observations  that  the  processes 
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Q2 

are  martingales  with  continuous  sample  paths.  The  processes  oeBt~^rt  are  called 
exponential  martingales  of  Brownian  motion 
We  can  also  take,  for /  e  L2(M+,  <^(R+),  d t), 

zt  =  [  f(s )  d£, . 

Jo 

Properties  of  Gaussian  white  noise  imply  that  Z  has  independent  increments  with 
respect  to  the  canonical  filtration  of  B ,  and  thus 

f(s)dBs ,  (  J  /(,s)d/iv  j  -  J  f(s)2ds ,  exp  (0  J  f(s)ABs  ~~^J  f(s)2ds) 

are  martingales  (with  respect  to  this  filtration).  One  can  prove  that  these  martingales 
have  a  modification  with  continuous  sample  paths  -  it  is  enough  to  do  it  for  the  first 
one,  and  this  will  follow  from  the  more  general  results  in  Chap.  5  below. 

Finally,  if  Z  =  N  is  a  Poisson  process  with  parameter  A  (and  (jZi)  is  the  canonical 
filtration  of  N ),  it  is  well  known  that  Z  has  independent  increments,  and  we  get  that 

Nt-Xt ,  (N,  -  Xt)2  -  X  t,  exp  (ON,  -  X  t(ee  -  1)) 

are  martingales.  In  contrast  with  the  previous  examples,  these  martingales  do  not 
have  a  modification  with  continuous  sample  paths. 

Proposition  3.12  Let  (Xt)t>o  be  an  adapted  process  and  let  f  :  R  — >  M+  be  a 
convex  function  such  that  E\f(Xt)\  <  oo  for  every  t  >  0. 

(i)  If  (Xt)t> o  is  a  martingale,  then  (f(Xt))t> o  is  a  submartingale. 

(ii)  If  (Xt)t> o  is  a  submartingale,  and  if  in  addition  f  is  nondecreasing,  then 
(f(Xt))t> o  is  a  submartingale. 

Proof  By  Jensen’s  inequality,  we  have,  for  s  <  t, 

E[f(Xt)  I  &s]  >f(E[Xt  |  &s\)  >f(Xs). 

In  the  last  inequality,  we  need  the  fact  that  /  is  nondecreasing  when  (Xt)  is  only  a 
submartingale.  □ 

Consequences  If  (JQ?>o  is  a  martingale,  \Xt\  is  a  submartingale  and  more  gener¬ 
ally,  for  every  p  >  1,  \Xt\p  is  a  submartingale,  provided  that  we  have  ^[IX^]  <  oo 
for  every  t  >  0.  If  (Xt)t>o  is  a  submartingale,  (W)+  =  Xt  v0  is  also  a  submartingale. 

Remark  If  (Xt)t>o  is  any  martingale,  Jensen’s  inequality  shows  that  is  a 

nondecreasing  function  of  t  with  values  in  [0,  oo],  for  every  p  >  1. 
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Proposition  3.13  Let  (Xt)t>o  be  a  submartingale  or  a  supermartingale.  Then,  for 
every  t  >  0, 

sup  £[|Xy|]  <  00. 

0  <s<t 

Proof  It  is  enough  to  treat  the  case  where  (Xt)t>o  is  a  submartingale.  Since  (2Q+  is 
also  a  submartingale,  we  have  for  every  s  G  [0,  t\, 

E[(XS)+]  <  E[(X,)+}. 

On  the  other  hand,  since  X  is  a  submartingale,  we  also  have  for  s  G  [0,  t], 

E[XS]  >  E[X o]. 

By  combining  these  two  bounds,  and  noting  that  \x\  —  2x+  —  x,  we  get 

sup  £[|X,|]  <  2  E[(X,)+]  -  E[X0\  <  00, 

se[0,t] 

giving  the  desired  result.  □ 

The  next  proposition  will  be  very  useful  in  the  study  of  square  integrable 
martingales. 

Proposition  3.14  Let  (Mt)t> o  be  a  square  integrable  martingale  ( that  is,  Mt  G  L2 
for  every  t  >  0).  Let  0  <  s  <  t  and  let  s  =  to  <  t\  <•••<  tp  =  t  be  a  subdivision 
of  the  interval  [s,  t].  Then, 

p 

E[  X]  =  E[Mj  -  M)  |  &s]  =  E[(M,  -  Ms)2  \  &]. 

i=  1 

In  particular, 

p 

e[  -  M,,_,)2]  =  E[M;  -  M2]  =  E[(Mt  -  Ms)2]. 

i=  1 

Proof  For  every  i  —  1 

E[(Mti  -M,_,)2  |  ft]  =  E[E[(MU  -M,,_,)2  |  \  ft] 

=  E[E[Ml  |  ftu_x\  -  2 M,_,  E[Mti  |  ft^l  +  M2_j  |  ft, 

=  e[e[m2.  I  ftti_,]  -  m2_,  I  ft, 

=  E[M2  -  M2_,  I  ft] 


and  the  desired  result  follows  by  summing  over  i. 


□ 
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Our  next  goal  is  to  study  the  regularity  properties  of  sample  paths  of  martingales 
and  supermartingales.  We  first  establish  continuous  time  analogs  of  classical 
inequalities  in  the  discrete  time  setting. 

Proposition  3.15 

(i)  (Maximal  inequality)  Let  (Xt)t>o  be  a  supermartingale  with  right -continuous 
sample  paths.  Then,  for  every  t  >  0  and  every  A  >  0, 


A P(  sup  \XS\  >  A)  <  E[\X 0|]  +  2E[\Xt\}. 

v  0<s<t  7 


(ii)  (Doob’s  inequality  in  Lp)  Let  {Xt)t>o  be  a  martingale  with  right- continuous 
sample  paths.  Then,  for  every  t  >  0  and  every  p  >  \, 


sup  |X,| 

L  ()<s<t 


< 


Note  that  part  (ii)  of  the  proposition  is  useful  only  if  E'lJXfp’]  <  oo. 

Proof 

(i)  Fix  t  >  0  and  consider  a  countable  dense  subset  D  of  M+  such  that  0  e  D 
and  t  e  D.  Then  D  D  [0,  t\  is  the  increasing  union  of  a  sequence  (Dm)m>\  of 
finite  subsets  [0,  t\  of  the  form  Dm  —  {t™ ,  t™, . . . ,  Iff)  where  0  =  <  f[l  < 

•  •  •  <  —  t.  For  every  fixed  m,  we  can  apply  the  discrete  time  maximal 

inequality  (see  Appendix  A2)  to  the  sequence  Yn  —  Xtn/xm ,  which  is  a  discrete 
supermartingale  with  respect  to  the  filtration  %  =  &n a,„-  We  get 

A  p(  sup  |ZS|  >  A)  <  £[|X0|]  +  2E[\X,\\. 

v  seDm  7 


Then,  we  observe  that 


P 


sup 

s€£>n[o,r] 


when  m  f  oo.  We  have  thus 

A  P(  sup  \XS\  >  A)  <  E[\Xo\\  +  2E[\Xt\\. 

yseDH[0,t]  7 

Finally,  the  right-continuity  of  sample  paths  (and  the  fact  that  t  e  D)  ensures 
that 


sup  \XS\  —  sup  \XS 
^€£>n[0,r]  se[0,t] 


(3.1) 


Assertion  (i)  now  follows. 
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(ii)  Following  the  same  strategy  as  in  the  proof  of  (i),  and  using  now  Doob’s 
inequality  in  LP  for  discrete  martingales  (see  Appendix  A2),  we  get,  for  every 
m  >  1, 


sup  \XS 


LseD 


m 


Now  we  just  have  to  let  m  tend  to  infinity,  using  the  monotone  convergence 
theorem  and  then  the  identity  (3.1). 

□ 


Remark  If  we  no  longer  assume  that  the  sample  paths  of  the  supermartingale  X  are 
right-continuous,  the  preceding  proof  shows  that,  for  every  countable  dense  subset 
D  of  R+,  and  every  t  >  0, 

P(  sup  \XS\  >  A)  <  j(E[\X 0|]  +  2E[\Xt\]). 
v^GDn[o,/1]  7 

Letting  A  ->  oo,  we  have  in  particular 

sup  \XS\  <  oo  ,  a.s. 

^eDnfo,/1] 


Upcrossing  numbers  Let /  :  I  — >  R  be  a  function  defined  on  a  subset  I  of  M+ .  If 
a  <b,  the  upcrossing  number  off  along  [a,b\,  denoted  by  M^ab (/),  is  the  maximal 
integer  k  >  1  such  that  there  exists  a  finite  increasing  sequence  s\  <  t\  <  •  •  •  < 
Sk  <  4  of  elements  of  I  such  that  f(si)  <  a  and  f(ti)  >  b  for  every  i  e  {1, . . . ,  k} 
(if,  even  for  k  =  1,  there  is  no  such  subsequence,  we  take  M^ab(f)  =  0,  and  if  such  a 

subsequence  exists  for  every  k  >  1,  we  take  M^ab(J)  —  oo).  Upcrossing  numbers  are 
a  convenient  tool  to  study  the  regularity  of  functions. 

In  the  next  lemma,  the  notation 


means 


lim/00 

^44 


(resp.  lim/O) ) 

sfft 


lim  f(s)  (resp.  lim  f(s)  ). 

We  say  that  g  :  M+  ->  R  is  cadlag  (for  the  French  “continue  a  droite  avec  des 
limites  a  gauche”)  if  g  is  right-continuous  and  has  left-limits  at  every  t  >  0. 

Lemma  3.16  Let  D  be  a  countable  dense  subset  o/M+  and  let  f  be  a  real  function 
defined  on  D.  We  assume  that,  for  every  T  e  D, 

(i)  the  function  f  is  bounded  on  D  D  [0,  T\; 

(ii)  for  all  rationals  a  and  b  such  that  a  <  b, 


Mfab(D  n  [0,  7])  <  00. 
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Then,  the  right-limit 


lim  f(s ) 

exists  for  every  real  t  >  0,  and  similarly  the  left-limit 


fit-)  ■■= 


lira  /(.?) 


exists  for  every  real  t  >  0.  Furthermore,  the  function  g  :  M+  — >  R  defined  by 
g(t)  =f(t~h)  is  cadlag. 

We  omit  the  proof  of  this  analytic  lemma.  It  is  important  to  note  that  the  right  and 
left-limits  f(t+)  and  f(t—)  are  defined  for  every  t  >  0  (t  >  0  in  the  case  of  fit—)) 
and  not  only  for  t  e  D. 

Theorem  3.17  Let  (Xt)t>o  be  a  supermartingale,  and  let  D  be  a  countable  dense 
subset  of  R_|_. 

(i)  For  almost  every  oo  G  Q,  the  restriction  of  the  function  s  i->  Xs(oo )  to  the  set  D 
has  a  right-limit 


Xt+(co)  :=  lim  Xs(oo)  (3.2) 

at  every  t  e  [0,  oo),  and  a  left-limit 


Xt-(co)  lim  Xs(co) 


at  every  t  e  (0,  oo). 

(ii)  For  every  t  G  R+,  G  L1  and 


xt  > 


w/t/z  equality  if  the  function  t  — >  E[Xt]  is  right- continuous  (in  particular  ifX 
is  a  martingale).  The  process  (Xr+)r>o  is  a  supermartingale  with  respect  to  the 
filtration  (J^+).  It  is  a  martingale  ifX  is  a  martingale. 

Remark  For  the  last  assertions  of  (ii),  we  need  Xt+  (co)  to  be  defined  for  every 
co  G  Q  and  not  only  outside  a  negligible  set.  As  we  will  see  in  the  proof,  we  can 
just  take  (oo)  —  0  when  the  limit  in  (3.2)  does  not  exist. 

Proof 

(i)  Fix  T  gD.  By  the  remark  following  Proposition  3.15,  we  have 

sup  \XS\  <  oo  ,  a.s. 
sGDn[o,r] 


56 


3  Filtrations  and  Martingales 


As  in  the  proof  of  Proposition  3.15,  we  can  choose  a  sequence  (Dm)m>\  of 
finite  subsets  of  D  that  increase  to  D  D  [0,  T\  and  are  such  that  0 ,T  e  Dm. 
Doob’s  upcrossing  inequality  for  discrete  supermartingales  (see  Appendix  A2) 
gives,  for  every  a  <  b  and  every  m>  1 , 

E[Mxah(Dm)]  <  -}—E[(XT  -  a)-]. 

b  —  a 

We  let  vn  — >  oo  and  get  by  monotone  convergence 

E[Mxab(D  n  [0,  7])]  <  t^—E[{Xt  -  a)~]  <  OO. 

b  —  a 

We  thus  have 


M*b([ 0,  T]  fl  D)  <  oo  ,  a.s. 


Set 


U  ({  SUP  IV|  =  ooj  (J  (  U  {Mab(E>  n  [0,  T]) 

TeD  teD(l[0,T]  a,beO,a<b 


=  OO 


Then  P(N)  —  0  by  the  preceding  considerations.  On  the  other  hand,  if  co  ^  N, 
the  function  D  3  1 1->  Xt(co)  satisfies  all  assumptions  of  Lemma  3.16.  Assertion 
(i)  now  follows  from  this  lemma. 

(ii)  To  define  Xr+  (co)  for  every  co  £  Q  and  not  only  on  £2\N,  we  set 


Xt+{a>)  = 


lim  Xs(co)  if  the  limit  exists 

s-l-lt,s€D 

0  otherwise. 


With  this  definition,  is  -measurable. 

Fix  t  >  0  and  choose  a  sequence  (tn)n> o  in  D  such  that  tn  decreases  strictly 
to  t  as  n  ->  oo.  Then,  by  construction,  we  have  a.s. 


Xt+  =  lim  Xtn . 

n->o o 

Set  Yk  =  Xt_k  for  every  integer  k  <  0.  Then  Y  is  a  backward  super¬ 
martingale  with  respect  to  the  (backward)  discrete  filtration  =  YXt_k  (see 
Appendix  A2).  From  Proposition  3.13,  we  have  sup^<0JE[|F^|]  <  oo.  The 
convergence  theorem  for  backward  supermartingales  (see  Appendix  A2)  then 
implies  that  the  sequence  Xtn  converges  to  Xt+  in  L1 .  In  particular,  Xt+  e  Ll . 

Thanks  to  the  L1 -convergence,  we  can  pass  to  the  limit  n  oo  in  the 
inequality  Xt  >  E[Xtn  \  and  we  get 


Xt  >  E[Xt+  |  &t\ 
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(we  use  the  fact  that  the  conditional  expectation  is  continuous  for  the  L1  -norm, 
and  it  is  important  to  realize  that  an  a.s.  convergence  would  not  be  sufficient 
to  warrant  this  passage  to  the  limit).  Furthermore,  thanks  again  to  the  L1- 
convergence,  we  have  E[Xt+\  =  \imE[Xtn].  Thus,  if  the  function  s  — >  E[XS\ 
is  right-continuous,  we  must  have  E[Xt]  —  E[Xt+\  =  E[E[Xt+  |  #)]],  and  the 
inequality  Xt  >  E[Xt+  \  &t\  then  forces  Xt  —  E[Xt+  \  TPt\. 

We  already  noticed  that  Xf+  is  -measurable.  Let  s  <  t  and  let  (sn)n> o 
be  a  sequence  in  D  that  decreases  strictly  to  s.  We  may  assume  that  sn  <  tn  for 
every  n.  Then  as  previously  XSn  converges  to  Xs+  in  L1,  and  thus,  if  A  e 
which  implies  A  e  &Sn  for  every  n ,  we  have 

E[XS+  1A]  =  lim  E[XSn  1A\  >  lim  E[XtnlA]  =E[Xt+lA\  =  E[E[Xt+  \  ^S+]1A\. 

n—>oo  «— >oo 


Since  this  inequality  holds  for  every  A  e  and  since  Xs+  and  E[Xt+  \  &S+] 
are  both  -measurable,  it  follows  that  Xs+  >  |  &s+\  •  Finally,  if 

X  is  a  martingale,  inequalities  can  be  replaced  by  equalities  in  the  previous 
considerations. 

□ 

Theorem  3.18  Assume  that  the  filtration  {AXt)  is  right- continuous  and  complete. 
Let  X  —  (Xt)t> o  he  a  supermartingale,  such  that  the  function  t  — >  E[Xt\  is  right- 
continuous.  Then  X  has  a  modification  with  cadlag  sample  paths,  which  is  also  an 
])-  supermartingale. 

Proof  Let  D  be  a  countable  dense  subset  of  M+  as  in  Theorem  3.17.  Let  N  be  the 
negligible  set  defined  in  (3.3).  We  set,  for  every  t  >  0, 


Yt(oj)  = 


Xt+  (a;)  if  co  N 
0  if  coeN. 


Lemma  3.16  then  shows  that  the  sample  paths  of  Y  are  cadlag. 

The  random  variable  is  -measurable,  and  thus  ^-measurable  since 
the  filtration  is  right-continuous.  As  the  negligible  set  N  belongs  to  J^oo,  the 
completeness  of  the  filtration  ensures  that  Yt  is  measurable.  By  Theorem  3.17 
(ii),  we  have  for  every  t  >  0, 


=  E[Xt+ 


&]  =  Xt+  =  Yu 


a.s. 


because  Xr+  is  ^-measurable.  Consequently,  Y  is  a  modification  of  X.  The  process 
Y  is  adapted  to  the  filtration  (J^i).  Since  Y  is  a  modification  of  X  the  inequality 
E[Xt  |  &s]  <  Xs,  for  0  <  ^  <  t,  implies  that  the  same  inequality  holds  for  Y.  □ 

Remarks 

(i)  Let  us  comment  on  the  assumptions  of  the  theorem.  A  simple  example  shows 
that  our  assumption  that  the  filtration  is  right-continuous  is  necessary.  Take 
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Q  —  {—1,1},  with  the  probability  measure  P  defined  by  P({1})  =  P({— 1})  = 
1/2.  Let  e  be  the  random  variable  £(co)  —  co ,  and  let  the  process  (Xt)t>o  be 
defined  by  Xt  =  0  if  0  <  t  <  1 ,  and  Xt  —  £  if  t  >  1 .  Then  it  is  easy  to  verify 
that  X  is  a  martingale  with  respect  to  its  canonical  filtration  (J^x)  (which  is 
complete  since  there  are  no  nonempty  negligible  sets!).  On  the  other  hand,  no 
modification  of  X  can  be  right-continuous  at  t  —  1 .  This  does  not  contradict  the 
theorem  since  the  filtration  is  not  right-continuous  (^f+  ^  J^~f). 

(ii)  Similarly,  to  show  that  the  right-continuity  of  the  mapping  t  — >  E[Xt\  is 
needed,  we  can  just  take  Xt  —  f(t ),  where/  is  any  nonincreasing  deterministic 
function.  If  /  is  not  right-continuous,  no  modification  of  X  can  have  right- 
continuous  sample  paths. 


3.4  Optional  Stopping  Theorems 

We  start  with  a  convergence  theorem  for  supermartingales. 

Theorem  3.19  Let  X  be  a  supermartingale  with  right -continuous  sample  paths. 
Assume  that  the  collection  {Xt)t> o  is  bounded  in  Ll.  Then  there  exists  a  random 
variable  X ^  e  Ll  such  that 


lim  Xt  —  Xoq,  a.s. 

t—^oo 

Proof  Let  D  be  a  countable  dense  subset  of  R+.  From  the  proof  of  Theorem  3.17, 
we  have,  for  every  T  e  D  and  a  <  b, 

E[Mxab(D  n  [0,  7-])]  <  -L E[(XT  -  a)-]. 

b  —  a 

By  monotone  convergence,  we  get,  for  every  a  <  b, 

E[Mxb(D)}  <  -2—  sup  E[(Xt-a)~]  <  oo, 

b  —  a  t>  o 

since  the  collection  (Xr)/>o  is  bounded  in  L1.  Hence,  a.s.  for  all  rationals  a  <  b,  we 
have  M*b(D)  <  oo.  This  implies  that  the  limit 

Xoo  :=  lim  X,  (3.4) 

D  3 1 — >-  oo 

exists  a.s.  in  [—00,00].  We  can  in  fact  exclude  the  values  +00  and  —00,  since 
Fatou’s  lemma  gives 


E[\XooW  <  lim  inf  £[|X?|]  <  00, 
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and  we  get  that  X ^  e  Ll .  The  right-continuity  of  sample  paths  (which  we  have  not 
yet  used)  allows  us  to  remove  the  restriction  t  e  Din  the  limit  (3.4).  □ 

Under  the  assumptions  of  Theorem  3.19,  the  convergence  of  Xt  towards  X^  may 
not  hold  in  Ll .  The  next  result  gives,  in  the  case  of  a  martingale,  necessary  and 
sufficient  conditions  for  the  convergence  to  also  hold  in  L1 . 

Definition  3.20  A  martingale  (Xr)r>o  is  said  to  be  closed  if  there  exists  a  random 
variable  Z  e  L1  such  that,  for  every  t  >  0, 

X,  =  E[Z  |  ^]. 


Theorem  3.21  Let  X  be  a  martingale  with  right- continuous  sample  paths.  Then  the 
following  properties  are  equivalent: 


(i)  X  is  closed; 

(ii)  the  collection  (Xt)t> o  is  uniformly  integrable; 

(iii)  Xt  converges  a.s.  and  in  Ll  as  t  —>  oo. 


Moreover,  if  these  properties  hold,  we  haveXt  —  E[X ^ 
Xoo  g  L1  is  the  a.s.  limit  ofXt  as  t  —>  oo. 


&t\  for  every  t  >  0,  where 


Proof  The  fact  that  (i)=4>(ii)  is  easy:  If  Z  G  L1,  the  collection  of  all  random  variables 
E[Z  |  £f],  when  Z?  varies  over  sub-cr-fields  of  is  uniformly  integrable.  If  (ii) 
holds,  in  particular  the  collection  (Xf)f>o  is  bounded  in  L1  and  Proposition  3.19 
implies  that  Xt  converges  a.s.  to  Xoo.  By  uniform  integrability,  the  latter  convergence 
also  holds  in  L1 .  Finally,  if  (iii)  holds,  for  every  s  >  0,  we  can  pass  to  the  limit  t  -> 
oo  in  the  equality  Xs  —  E[Xt  \  &s]  (using  the  fact  that  the  conditional  expectation  is 
continuous  for  the  Z^-norm),  and  we  get  Xs  —  E[X0 Q  \  □ 

We  will  now  use  the  optional  stopping  theorems  for  discrete  martingales  and 
supermartingales  in  order  to  establish  similar  results  in  the  continuous  time  setting. 
Let  (X?)r>0  be  a  martingale  or  a  supermartingale  with  right-continuous  sample  paths, 
and  such  that  Xt  converges  a.s.  as  t  ->  oo  to  a  random  variable  denoted  by  X^.  Then, 
for  every  stopping  time  T,  we  write  XT  for  the  random  variable 


Xt(co)  —  1  {T(oo)<oo}Xt(m)(oj)  +  l{T(oo)  =  oo}Xoo  (co). 

Compare  with  Theorem  3.7,  where  the  random  variable  XT  was  only  defined  on 
the  subset  {T  <  oo}  of  Q.  With  this  definition,  the  random  variable  XT  is  still 
-measurable:  Use  Theorem  3.7  and  the  easily  verified  fact  that  l{r=oo}^oo  is 
-measurable. 

Theorem  3.22  (Optional  stopping  theorem  for  martingales)  Let  (Xt)t>o  be  a 
uniformly  integrable  martingale  with  right -continuous  sample  paths.  Let  S  and  T  be 
two  stopping  times  with  S  <  T.  Then  Xs  and  Xt  are  in  Ll  and 


Xs  —  E[Xt  |  ^s\- 
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In  particular,  for  every  stopping  time  S,  we  have 


Xs  =  E[X oo 


and 


E[XS]  =  E[X  oo]  =  E[X o]. 


Proof  Set,  for  every  integer  n  >  0, 


k=0 


l{k2-n<T<(k+l)2~n}  +  00  ‘  l{r=oo} 


and  similarly 


l{k2~n<S<(k+l)2~n}  +  OO  •  1(5=00}- 


By  Proposition  3.8,  (Tn)  and  (Sn)  are  two  sequences  of  stopping  times  that  decrease 
respectively  to  T  and  to  S.  Moreover,  we  have  Sn  <  Tn  for  every  n  >  0. 

Now  observe  that,  for  every  fixed  n ,  2 nSn  and  2 nTn  are  stopping  times  of  the 

discrete  filtration  :=  f^k/2n,  and  is  a  discrete  martingale 

with  respect  to  this  filtration.  From  the  optional  stopping  theorem  for  uniformly 
integrable  discrete  martingales  (see  Appendix  A2)  we  get  that  Y$s  and  Y^T  are  in 
Ll,  and 


y(n ) 

2  2  nSn 


=  E[Xt„  | 


(here  we  need  to  verify  that  but  this  is  straightforward). 

Let  A  G  #5.  Since  ^5  c  ^sn,  we  have  A  G  ^sn  and  thus 


By  the  right-continuity  of  sample  paths,  we  get  a.s. 

Xs  =  lim  XSn  ,  XT  =  lim  XTn . 

h— >oo  n— >00 

These  limits  also  hold  in  L1 .  Indeed,  thanks  again  to  the  optional  stopping  theorem 
for  uniformly  integrable  discrete  martingales,  we  have  Xsn  —  E\X ^  \  &sn]  f°r  every 
n ,  and  thus  the  sequence  (Xsn)  is  uniformly  integrable  (and  the  same  holds  for  the 
sequence  (XTn)). 
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The  L1  -convergence  implies  that  Xs  and  XT  belong  to  L1 ,  and  also  allows  us  to 
pass  to  the  limit  n  ->  oo  in  the  equality  E[lAXsn]  —  E[lAXTn\  in  order  to  get 


E[1AXS\  =E[1aXt\. 


Since  this  holds  for  every  A  e  ffis,  and  since  the  variable  Xs  is  ^-measurable  (by 
the  remarks  before  the  theorem),  we  conclude  that 


X*  =  E[Xt  I  &s\ 


which  completes  the  proof.  □ 

We  now  give  two  corollaries  of  Theorem  3.22. 

Corollary  3.23  Let  (Xt)t>o  be  a  martingale  with  right -continuous  sample  paths, 
and  let  S  <  T  be  two  bounded  stopping  times.  Then  Xs  and  Xj  are  in  Ll  and 


Xs  —  E[Xt  |  &s\  • 


Proof  Let  a  >  0  such  that  S  <  T  <  a.  We  apply  Theorem  3.22  to  the  martingale 
(XtAa)t>o  which  is  closed  by  Xa.  □ 

The  second  corollary  shows  that  a  martingale  (resp.  a  uniformly  integrable 
martingale)  stopped  at  an  arbitrary  stopping  time  remains  a  martingale  (resp.  a 
uniformly  integrable  martingale).  This  result  will  play  an  important  role  in  the  next 
chapters. 

Corollary  3.24  Let  (Xt)t>o  be  a  martingale  with  right -continuous  sample  paths, 
and  let  T  be  a  stopping  time. 

(i)  The  process  (XtAr)t> o  is  still  a  martingale. 

(ii)  Suppose  in  addition  that  the  martingale  (Xt)t>o  is  uniformly  integrable.  Then 
the  process  ( XtAT)t>o  is  also  a  uniformly  integrable  martingale,  and  more 
precisely  we  have  for  every  t  >  0, 

XtAT  =  E[XT  \  &t\.  (3.5) 

Proof  We  start  with  the  proof  of  (ii).  Note  that  t  A  T  is  a  stopping  time  by  property 
(f)  of  stopping  times.  By  Theorem  3.22,  XtAj  and  XT  are  in  L1,  and  we  also  know 
that  WA;r  is  J^Ar-measurable,  hence  ^-measurable  since  ^at  C  ^t.  So  in  order 
to  get  (3.5),  it  is  enough  to  prove  that,  for  every  A  e  TXt, 


E[1aXt\  =  E[lAXtAT]. 


Let  us  fix  A  e  &t.  First,  we  have  trivially 


E[lAn{T<t}  Xt\  —  E[  lAn{T<t}XtAr\- 


(3.6) 
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On  the  other  hand,  by  Theorem  3.22,  we  have 

XtAT  —  E[Xt  I 

and  we  notice  that  we  have  both  A  n  {T  >  t}  e  XX t  and  A  D  {T  >  t}  e  XX j  (the  latter 
as  a  straightforward  consequence  of  the  definition  of  so  that  A  Pi  {T  >  t}  e 
XX t  fl  XX j  —  XXtAT-  Using  the  preceding  display,  we  obtain 

E[lAn{T>t}  Xt\  =  E[lAn{T>t}XtAT]. 

By  adding  this  equality  to  (3.6),  we  get  the  desired  result. 

To  prove  (i),  we  just  need  to  apply  (ii)  to  the  (uniformly  integrable)  martingale 
(XtAa)a>o,  for  any  choice  of  a  >  0.  □ 

Applications  Above  all,  the  optional  stopping  theorem  is  a  powerful  tool  for 
explicit  calculations  of  probability  distributions.  Let  us  give  a  few  important  and 
typical  examples  of  such  applications  (several  other  examples  can  be  found  in  the 
exercises  of  this  and  the  following  chapters).  Let  B  be  a  real  Brownian  motion  started 
from  0.  We  know  that  B  is  a  martingale  with  continuous  sample  paths  with  respect 
to  its  canonical  filtration.  For  every  real  a ,  set  Ta  —  inf{t  >  0  :  Bt  —  a).  Recall  that 
Ta  <  oo  a.s. 

(a)  Law  of  the  exit  point  from  an  interval.  For  every  a  <  0  <  b,  we  have 

P(Ta  <  Tb)  =  ,  P(Tb  <  Ta)  = 

b  —  a  b  —  a 

To  get  this  result,  consider  the  stopping  time  T  —  Ta  A  Tb  and  the  stopped 
martingale  Mt  —  Btat  (this  is  a  martingale  by  Corollary  3.24).  Then,  \M\  is 
bounded  above  by  b  v  \a\,  and  the  martingale  M  is  thus  uniformly  integrable. 
We  can  apply  Theorem  3.22  and  we  get 

0  =  E[M0\  =  E[Mt\  =  bP(Tb  <  Ta)  +  aP(Ta  <  Tb). 


Since  we  also  have  P(Tb  <  Ta)  +P(Ta  <  Tb )  =  1,  the  desired  result  follows.  In 
fact  the  proof  shows  that  the  result  remains  valid  if  we  replace  Brownian  motion 
by  a  martingale  with  continuous  sample  paths  and  initial  value  0,  provided  we 
know  that  this  process  exits  (a,  b)  a.s. 

(b)  First  moment  of  exit  times.  For  every  a  >  0,  consider  the  stopping  time  Ua  — 
inf{t  >  0  :  \Bt\  —  a).  Then 


E[Ua]  =  a2. 

To  verify  this,  consider  the  martingale  Mt  —  Bj  —  t.  By  Corollary  3.24, 
MtAUa  is  still  a  martingale,  and  therefore  E[MtAUa\  —  E[Mo]  —  0,  giving 
E[(B,  AUa)2]  =  E[t  A  Ua\.  Then,  on  one  hand,  E[t  A  Ua\  converges  to  E[Ua\  as 
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t  ->  oo  by  monotone  convergence,  on  the  other  hand,  E[(BtAUJ2]  converges  to 
E[{BUa )2]  =  a2  as£  ->  oo,  by  dominated  convergence  (note  that  (BtAUa)2  <  a2). 
The  stated  result  follows.  We  may  observe  that  we  have  E[Ua\  <  oo  in  contrast 
with  the  property  E[Ta\  —  oo,  which  was  noticed  in  Chap.  2. 

(c)  Laplace  transform  of  hitting  times.  We  now  fix  a  >  0  and  our  goal  is  to  compute 
the  Laplace  transform  of  Ta.  For  every  A  e  R,  we  can  consider  the  exponential 
martingale 


A2 

—  exp  (A  Bt - t). 

Suppose  first  that  A  >  0.  By  Corollary  3.24,  the  stopped  process  N^aT  is  still 
a  martingale,  and  we  immediately  see  that  this  martingale  is  bounded  above  by 
eAfl,  hence  uniformly  integrable.  By  applying  the  last  assertion  of  Theorem  3.22 
to  this  martingale  and  to  the  stopping  time  S  —  Ta  (or  to  S  —  oo)  we  get 

eAa£[e  2  r“]  =  E[N$a\  =  £[A^]  =  1  . 

Replacing  A  by  V2A,  we  conclude  that,  for  every  A  >  0, 

E[e~XT“]  =  e-"^.  (3.7) 


(This  formula  could  also  be  deduced  from  the  knowledge  of  the  density  of  Ta , 
see  Corollary  2.22.)  As  an  instructive  example,  one  may  try  to  reproduce  the 
preceding  line  of  reasoning,  using  now  the  martingale  N2  for  A  <  0:  one  gets 
an  absurd  result,  which  can  be  explained  by  the  fact  that  the  stopped  martingale 
N2aT  is  not  uniformly  integrable  when  A  <  0.  When  applying  Theorem  3.22, 
it  is  crucial  to  always  verify  the  uniform  integrability  of  the  martingale.  In  most 
cases,  this  is  done  by  verifying  that  the  (stopped)  martingale  is  bounded. 

(d)  Laplace  transform  of  exit  times  from  an  interval.  With  the  notation  of  (b),  we 
have  for  every  a  >  0  and  every  A  >  0, 


£[exp(-At/a)] 


cosh(aV2A) 


To  see  this,  first  note  that  Ua  and  BUa  are  independent  since,  using  the  symmetry 
property  of  Brownian  motion, 


E[l{BUa=a)  exp(— Al/a)]  =  E[ l{BUa=-a}  exp(-A  Ua)\  =  -  £[exp(-A [/„)]. 

Then  the  claimed  formula  is  proved  by  the  same  method  as  in  (c),  writing 
E[Ny  ]  —  =  1  and  noting  that  the  application  of  the  optional  stopping 
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theorem  is  justified  by  the  fact  that  N^aU  is  bounded  above  by  eAa.  See  also 
Exercise  3.27  for  a  more  general  formula. 

We  end  this  chapter  with  the  optional  stopping  theorem  for  nonnegative  super¬ 
martingales.  This  result  will  be  useful  in  later  applications  to  Markov  processes. 
We  first  note  that,  if  (Zt)t> o  is  a  nonnegative  supermartingale  with  right-continuous 
sample  paths,  (Zt)t>0  is  automatically  bounded  in  L1  since  E[Zt]  <  E[Zq\,  and  by 
Theorem  3.19,  Zt  converges  a.s.  to  a  random  variable  e  Ll  as  t  ->  oo.  As 
explained  before  Theorem  3.22,  we  can  thus  make  sense  of  Zj  for  any  (finite  or  not) 
stopping  time  T. 

Theorem  3.25  Let  (Zt)t> o  be  a  nonnegative  supermartingale  with  right -continuous 
sample  paths.  Let  U  and  V  be  two  stopping  times  such  that  U  <  V.  Then,  Zjj  and 
Zy  are  in  Ll,  and 


Zu  >  E[Zy  |  TZjj\- 


Remark  This  implies  that  E[Zjj\  >  E[ZV\ ,  and  since  Zu  —  Zy  —  Z on  the  event 
{U  =  oo},  it  also  follows  that 


E[l{u<oo}Zu\  >  E[l{f/<oo}  Zy]  >  E[l{y<00}Zy]. 

Proof  In  the  first  step  of  the  proof,  we  make  the  extra  assumption  that  U  and  V  are 
bounded  and  we  verify  that  we  then  have  E[Zu\  >  E[ZV\.  Let  p  >  1  be  an  integer 
such  that  U  <  p  and  V  <  p.  For  every  integer  n  >  0,  set 


p2n- 1  ,  ,  , 

Eft  T-  1 

k=0 


p2n~l  ,  1 

Eft  H-  1 

1(^2— «<V<(^+1)2— 

k=0 


in  such  a  way  (by  Proposition  3.8)  that  (Un)  and  (Vn)  are  two  sequences  of  bounded 
stopping  times  that  decrease  respectively  to  U  and  V,  and  additionally  we  have  Un  < 
for  every  «  >  0.  The  right-continuity  of  sample  paths  ensures  that  ZUn  — >  Zv 
and  ZVn  — >  Zy  a.s.  as  n  ->  oo.  Then,  by  the  optional  stopping  theorem  for  discrete 
supermartingales  in  the  case  of  bounded  stopping  times  (see  Appendix  A2),  with 
respect  to  the  filtration  (J^/2«+i)&>o,  we  have  for  every  n  >  0, 

ZC/„+!  >  £[ZC/„  I  ^„+J. 

Setting  Yn  —  Zu_n  and  =  ZPu-n,  for  every  integer  n  <  0,  we  get  that 
the  sequence  (Yn)n< o  is  a  backward  supermartingale  with  respect  to  the  filtration 
(Ztfn)n< o.  Since,  for  every  ft  >  0,  E[ZUn\  <  E[Zq\  (by  another  application  of  the 
discrete  optional  stopping  theorem),  the  sequence  (Yn)n< 0  is  bounded  in  L1 ,  and 
by  the  convergence  theorem  for  backward  supermartingales  (see  Appendix  A2),  it 
converges  in  L1 .  Hence  the  convergence  of  ZUn  to  Zv  also  holds  in  L1  and  similarly 
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the  convergence  of  Zyn  to  Zy  holds  in  L1 .  Since  Un  <  Vn,  by  yet  another  application 
of  the  discrete  optional  stopping  theorem,  we  have  E[ZUn\  >  E[ZV n].  Using  the 
L1 -convergence  of  ZUn  and  ZVn  we  can  pass  to  the  limit  n  ->  oo  and  obtain  that 
E[Zjj\  >  E[ZV\  as  claimed. 

Let  us  prove  the  statement  of  the  theorem  (no  longer  assuming  that  U  and  V  are 
bounded).  By  the  first  step  of  the  proof  applied  to  the  stopping  times  0  and  U  Ap,  we 
have  E[ZUAp\  <  E[Zf\  for  every  p  >  1,  and  Fatou’s  lemma  gives  E[ZV\  <  £[Zo]  < 
oo  and  similarly  E[Zy]  <  oo.  Fix  A  e  AZ u  C  J^y  and  recall  our  notation  UA  for 
the  stopping  time  defined  by  UA(oo)  —  U(oo)  if  co  e  A  and  UA(oo)  —  oo  otherwise 
(cf.  property  (d)  of  stopping  times).  By  the  first  part  of  the  proof,  we  have,  for  every 
P>  1, 


E[ZuAAp\  —  E \ZyA  A/; ]  • 

By  writing  each  of  these  two  expectations  as  a  sum  of  expectations  over  the  sets  Ac, 
An{U  <  p}  and  A  D  {t/  >  p },  and  noting  that  U  >  p  implies  V  >  p,  we  get 

E[Zjj  1ad{u<p}\  —  E[ZV  1  An{u<p}\- 
Letting  p  — >  oo  then  gives 

^[Z?7  lAD{f/<oo}]  >E[ZV 

lAn{£/<oo}]- 

On  the  other  hand,  the  equality  E[Zjj  lAn{f/=oo}]  —  E[Zy  lAn{t/=oo}]  is  trivial  and 
by  adding  it  to  the  preceding  display,  we  obtain 

E[ZV  1a]  >  E[ZV  1A]  =  E[E[ZV  \  &v\  1A]. 

Since  this  holds  for  every  A  e  AZu  and  Zjj  is  ZPjj -measurable,  the  desired  result 
follows.  □ 


Exercises 

In  the  following  exercises,  processes  are  defined  on  a  probability  space  (X2,  P) 
equipped  with  a  complete  filtration  (f^t)te[  o,oo]  • 

Exercise  3.26 

1.  Let  M  be  a  martingale  with  continuous  sample  paths  such  that  M0  =  v  e  R+. 
We  assume  that  Mt  >  0  for  every  t  >  0,  and  that  Mt  — >  0  when  t  ->  oo,  a.s. 
Show  that,  for  every  y  >  x, 
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2.  Give  the  law  of 


sup  Bt 

t<T0 


when  B  is  a  Brownian  motion  started  from  v  >  0  and  To  =  inf{t  >  0  :  Bt  —  0}. 
3.  Assume  now  that  B  is  a  Brownian  motion  started  from  0,  and  let  /x  >  0.  Using 
an  appropriate  exponential  martingale,  show  that 

sup(Bt  —  fit) 

t>  o 

is  exponentially  distributed  with  parameter  2/x. 

Exercise  3.27  Let  B  be  an  (J^) -Brownian  motion  started  from  0.  Recall  the 
notation  Tx  —  inf{t  >  0  :  Bt  —  x},  for  every  v  e  R.  We  fix  two  real  numbers 
a  and  b  with  a  <  0  <  b,  and  we  set 

T  —  Ta  ATb  . 


1.  Show  that,  for  every  A  >  0, 


Ts[exp(— XT)]  — 


cosh(^y^  V2A) 
cosh(^  V2A) 


(Hint:  One  may  consider  a  martingale  of  the  form 


Mt  —  exp 


a)  —  X?j  +  exp  ^  —  V2A (Bt  —  a)  — 


with  a  suitable  choice  of  a.) 

2.  Show  similarly  that,  for  every  A  >  0, 


£[exp(- AL)  l{T=Ta}\ 


sinh(/?  V2A) 
sinh((/?  —  a)  V2X) 


3.  Recover  the  formula  for  P(Ta  <  T/})  from  question  (2). 

Exercise  3.28  Let  (Bt)t> o  be  an  (^)-Brownian  motion  started  from  0.  Let  a  >  0 
and 


cra  —  inf {t  >  0  :  Bt  <  t  —  a}. 


1.  Show  that  oa  is  a  stopping  time  and  that  aa  <  ooa.s. 
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2.  Using  an  appropriate  exponential  martingale,  show  that,  for  every  A  >  0, 

£[exp(— Xoa)\  —  exp(— a(y/ 1  +  2A  —  1)). 

The  fact  that  this  formula  remains  valid  for  A  e  [—  |,  0[  can  be  obtained  via  an 
argument  of  analytic  continuation. 

2 

3.  Let  fi  e  R  and  Mt  —  exp (ptBt  —  ^t).  Show  that  the  stopped  martingale  M0aAt 
is  closed  if  and  only  if  /x  <  1 .  (Hint:  This  martingale  is  closed  if  and  only  if 

E\M°a\  =  1) 

Exercise  3.29  Let  (U)r>o  be  a  uniformly  integrable  martingale  with  continuous 
sample  paths,  such  that  To  =  0.  We  set  Y ^  —  lim^oo  Yt.  Let  p  >  1  be  a  fixed 
real  number.  We  say  that  Property  (P)  holds  for  the  martingale  T  if  there  exists  a 
constant  C  such  that,  for  every  stopping  time  T,  we  have 

E[|Too  —  Yt\p  |  &t\  <  C. 

1.  Show  that  Property  (P)  holds  for  T  if  Y0 0  is  bounded. 

2.  Let  B  be  an  (#))- Brownian  motion  started  from  0.  Show  that  Property  (P)  holds 
for  the  martingale  Yt  —  BtA\.  (Hint:  One  may  observe  that  the  random  variable 
supr<1  \Bt\  is  in  LP .) 

3.  Show  that  Property  (P)  holds  for  T,  with  the  constant  C,  if  and  only  if,  for  any 
stopping  time  T, 


E[\  Yt  —  Tqo  \p]  <  C  P[T  <  oo] . 

(Hint:  It  may  be  useful  to  consider  the  stopping  times  TA  defined  for  A  e  AP T  in 
property  (d)  of  stopping  times.) 

4.  We  assume  that  Property  (P)  holds  for  T  with  the  constant  C.  Let  S  be  a 
stopping  time  and  let  Ys  be  the  stopped  martingale  defined  by  T^  =  YtAS  (see 
Corollary  3.24).  Show  that  Property  (P)  holds  for  T5  with  the  same  constant 
C.  One  may  start  by  observing  that,  if  S  and  T  are  stopping  times,  one  has 

yst  -  ySa t  -  YTS  =  E[YT  I  jy. 

5.  We  assume  in  this  question  and  the  next  one  that  Property  (P)  holds  for  T  with 
the  constant  C  =  1 .  Let  a  >  0,  and  let  (Rn)n> o  be  the  sequence  of  stopping  times 
defined  by  induction  by 


Rn+ 1  =  inf{r  >  Rn 


Y,-Yr 


n 


(inf0  =  oo). 


Show  that,  for  every  integer  n  >  0, 


ap  P(Rn+ 1  <  oo)  <  P(Rn  <  oo). 
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6.  Infer  that,  for  every  v  >  0, 


P 


( 


sup  Yt  >  x 

t>  0 


<  2P  2~px I1 


Notes  and  Comments 

This  chapter  gives  a  brief  presentation  of  the  so-called  general  theory  of  processes. 
We  limited  ourselves  to  the  notions  that  are  needed  in  the  remaining  part  of 
this  book,  but  the  interested  reader  can  consult  the  treatise  of  Dellacherie  and 
Meyer  [13,  14]  for  more  about  this  subject.  Most  of  the  martingale  theory  presented 
in  Sections  3  and  4  goes  back  to  Doob  [15].  A  comprehensive  study  of  the  theory  of 
continuous  time  martingales  can  be  found  in  [14].  The  applications  of  the  optional 
stopping  theorem  to  Brownian  motion  are  very  classical.  For  other  applications  in 
the  same  vein,  see  in  particular  the  book  [70]  by  Revuz  and  Yor.  Exercise  3.29  is 
taken  from  the  theory  of  BMO  martingales,  see  e.g.  [18,  Chapter  7]. 


Chapter  4 

Continuous  Semimartingales 


Continuous  semimartingales  provide  the  general  class  of  processes  with  continuous 
sample  paths  for  which  we  will  develop  the  theory  of  stochastic  integration  in 
the  next  chapter.  By  definition,  a  continuous  semimartingale  is  the  sum  of  a 
continuous  local  martingale  and  a  (continuous)  finite  variation  process.  In  the 
present  chapter,  we  study  separately  these  two  classes  of  processes.  We  start 
with  some  preliminaries  about  deterministic  functions  with  finite  variation,  before 
considering  the  corresponding  random  processes.  We  then  define  (continuous)  local 
martingales  and  we  construct  the  quadratic  variation  of  a  local  martingale,  which 
will  play  a  fundamental  role  in  the  construction  of  stochastic  integrals.  We  explain 
how  properties  of  a  local  martingale  are  related  to  those  of  its  quadratic  variation. 
Finally,  we  introduce  continuous  semimartingales  and  their  quadratic  variation 
processes. 


4.1  Finite  Variation  Processes 

In  this  chapter,  all  processes  are  indexed  by  M+  and  take  real  values.  The  first  section 
provides  a  brief  presentation  of  finite  variation  processes.  We  start  by  discussing 
functions  with  finite  variation  in  a  deterministic  setting. 


4.1.1  Functions  with  Finite  Variation 

In  our  discussion  of  functions  with  finite  variation,  we  restrict  our  attention  to 
continuous  functions,  as  this  is  the  case  of  interest  in  the  subsequent  developments. 
Recall  that  a  signed  measure  on  a  compact  interval  [0,  T]  is  the  difference  of  two 
finite  positive  measures  on  [0,  T\. 


©  Springer  International  Publishing  Switzerland  2016 

J.-F.  Le  Gall,  Brownian  Motion,  Martingales,  and  Stochastic  Calculus , 

Graduate  Texts  in  Mathematics  274,  DOI  10. 1007/978-3-3 19-3 1089-3_4 


69 


70 


4  Continuous  Semimartingales 


Definition  4.1  Let  T  >  0.  A  continuous  function  a  :  [0,  T]  — >  R  such  that  a(0)  —  0 
is  said  to  ha finite  variation  if  there  exists  a  signed  measure  /x  on  [0 ,T\  such  that 
a(t)  =  /x([0,  t])  for  every  t  G  [0,  T]. 

The  measure  /x  is  then  determined  uniquely  by  a.  Since  a  is  continuous  and 
a(0)  =  0,  it  follows  that  /x  has  no  atoms. 

Remark  The  general  definition  of  a  function  with  finite  variation  does  not  require 
continuity  nor  the  condition  a(0)  =  0.  We  impose  these  two  conditions  for 
convenience. 

The  decomposition  of  /x  as  a  difference  of  two  finite  positive  measures  on  [0,  T] 
is  not  unique,  but  there  exists  a  unique  decomposition  /x  =  /x+  —  /x_  such  that 
/x+  and  /x_  are  supported  on  disjoint  Borel  sets.  To  get  the  existence  of  such  a 
decomposition,  start  from  an  arbitrary  decomposition /x  =  i±\—  /X2,  set  v  =  /X1+/X2 
and  then  use  the  Radon-Nikodym  theorem  to  find  two  nonnegative  Borel  functions 
h\  and  on  [0,  T]  such  that 

/xi(dt)  =  /X2(d  t)  —  Ii2(t)v(dt). 

Then,  if  h(t)  —  h\(t)  —  /^(fh  we  have 

/x(dt)  =  h(t)v(dt)  —  h(t)+v(dt)  —  h(t)~v(dt), 

which  gives  the  decomposition /x  =  /x+ — /x_  with /x+(dt)  =  h(t)+v(dt),  /x_(dt)  = 
h(t)~v(dt),  and  the  measures  /x+  and  /x_  are  supported  respectively  on  the  disjoint 
Borel  sets  D+  —  {t  :  h(t)  >  0}  and  D-  —  {t  :  h(t)  <  0}.  The  uniqueness  of  this 
decomposition  /x  =  /x+  —  /x_  follows  from  the  fact  that  we  have  necessarily,  for 
every  A  e  ^([0,  T]), 


/x+(A)  =  sup {/x(C)  :  C  G  ^([0,  T]),  C  c  A}. 


We  write  |/x|  for  the  (finite)  positive  measure  |/x|  =  /x+  +  /x_.  The  measure  |/x| 
is  called  the  total  variation  of  a.  We  have  |/x(A)|  <  |/x|  (A)  for  every  A  G  ^([0,  T]). 
Moreover,  the  Radon-Nikodym  derivative  of  /x  with  respect  to  |/x|  is 


d/x 

d|/x 


The  fact  that  a(t)  =  /x+([0,t])  —  /x_([0,t])  shows  that  a  is  the  difference 
of  two  monotone  nondecreasing  continuous  functions  that  vanish  at  0  (since  /x 
has  no  atoms,  the  same  holds  for  /x+  of  /x_).  Conversely,  the  difference  of  two 
monotone  nondecreasing  continuous  functions  that  vanish  at  0  has  finite  variation 
in  the  sense  of  the  previous  definition.  Indeed,  this  follows  from  the  well-known  fact 
that  the  formula  g(t)  =  0([0,t]),  t  g  [0,  T]  induces  a  bijection  between  monotone 
nondecreasing  right-continuous  functions  g  :  [0,  T]  — >  M+  and  finite  positive 
measures  0  on  [0,  T]. 
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Let /  :  [0,  T]  — >  R  be  a  measurable  function  such  that  hQ  T *  |/(^)|  |/x|(ds)  <  oo. 
We  set 


L 


f(s)  da(s) 


L 


f(s)  |da(^) 


=  1 
-l 


f(s)p(ds), 
f(s )  |pt|(d^). 


Then  the  bound 


L 


f(s)  da(s) 


< 


l 


/(s)|  \da(s) 


holds.  By  restricting  a  to  [0,  t]  (which  amounts  to  restricting  \i,  /x+,  /x_),  we 
can  define  J^fis)  da(s)  for  every  t  e  [0,  7],  and  we  observe  that  the  function 

t  i— >  f*f(s)  d  a(s)  also  has  finite  variation  on  [0,  T]  (the  associated  measure  is  just 
fi'(ds)  —  f(s)p(ds)). 

Proposition  4.2  For  every  t  e  (0,7], 


(  P  ) 

|da(s)|  =  sup  <  ^2  I a(h)  ~  a(fi- i)l|  , 

(  i=  1  ) 

where  the  sup  remum  is  over  all  subdivisions  0  =  to  <  t\  <  •  •  •  <  tp  =  t  of  [0,  t\. 
More  precisely,  for  any  increasing  sequence  0  =  ^  <  f[  <  •  •  •  <  ^  —  t  of 
subdivisions  of  [ft,  t\  whose  mesh  tends  to  0,  we  have 

Pn 

lim  £W) -«(*->)!  = 

n->o o  z ' 
i=  1 

Remark  In  the  usual  presentation  of  functions  with  finite  variation,  one  starts  from 
the  property  that  the  supremum  in  the  first  display  of  the  proposition  is  finite. 

Proof  Clearly,  it  is  enough  to  treat  the  case  t  —  T.  The  inequality  >  in  the  first 
assertion  is  very  easy  since,  for  any  subdivision  0  =  to  <  t\  <  •  •  •  <  tp  =  T  of 
[0,  T], 


aiti)  -  a(ti- 1)|  =  <  |/a|(0;_i,  u}),  Vi  e  {1, . . .  ,p}, 


p  f 

^M(0;-idi])  =  N([M)  =  / 

i=  1 


da  (.s1) 


and 
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In  order  to  get  the  reverse  inequality,  it  suffices  to  prove  the  second  assertion.  So 
we  consider  an  increasing  sequence  0  =  <  f[  <  •  •  •  <  f  —  T  of  subdivisions 

of  [0,  T],  whose  mesh  max{^7  —  :  1  <  i  <  pn}  tends  to  0.  Although  we  are 

proving  a  “deterministic”  result,  we  will  use  a  martingale  argument.  Leaving  aside 
the  trivial  case  where  |/x|  =  0,  we  introduce  the  probability  space  Q  —  [0,  T\,  which 
is  equipped  with  the  Borel  a -field  38  —  3?([0 ,T])  and  the  probability  measure 
P(ds)  =  (|/z|([0,  T]))_1 1 /x |  (dv) .  On  this  probability  space,  we  consider  the  discrete 
filtration  (38n)n> o  such  that,  for  every  integer  n  >  0,  38n  is  the  a-field  generated  by 
the  intervals  ,  tj7],  1  <  i  <  pn.  We  then  set 


X(^)  =  1  d+0)  -  1  d-(s) 


and,  for  every  n  >  0, 


=  E[X  |  @n\. 

Properties  of  conditional  expectation  show  that  Xn  is  constant  on  every  interval 
,  t"]  and  takes  the  value 

Kifi-i’1"])  =  a(fi)  -  a(fi- 1) 

ImI  ImIW-p^]) 

on  this  interval.  On  the  other  hand,  the  sequence  (Xn)  is  a  closed  martingale,  with 
respect  to  the  discrete  filtration  (38n).  Since  X  is  measurable  with  respect  to  38  — 
3%n,  this  martingale  converges  to  X  in  L1,  by  the  convergence  theorem  for  closed 
discrete  martingales  (see  Appendix  A2).  In  particular, 

lim  E[\Xn\\  =E[\X\]  =  1, 

n^oo 


where  the  last  equality  is  clear  since  \X(js)\  —  1,  | /x  |  (dv)  a.e.  The  desired  result 
follows  by  noting  that 


Pn 

E[\xn\ ]  =  (N([o,n)r1EK<?)- 0(^)1, 

i=  1 


and  recalling  that  |/x|([0,  T])  =  |da(5)|.  □ 

We  now  give  a  useful  approximation  lemma  for  the  integral  of  a  continuous 
function  with  respect  to  a  function  with  finite  variation. 
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Lemma  4.3  Iff  :  [0,  T]  — >  R  is  a  continuous  function,  and  if0  =  f^<f[<---< 
tn  —  T  is  a  sequence  of  subdivisions  of[0,  T]  whose  mesh  tends  to  0,  we  have 


t 


f(s)  da  00 


Pn 

lim 

n—>o o  ‘  J 
i=  1 


Proof  Let  fn  be  defined  on  [0,  T ]  by/„(s)  =  )  if  s  €  (f‘_t ,  ff\,  1  <  i  <  pn,  and 

fn(0)  =/(0).  Then, 


Pn  /> 

£/«! 1)  (a(<?)  -  a^-,))  =  /  ,/„(.s')  M(dS), 

t=i 


and  the  desired  result  follows  by  dominated  convergence  since  fn  (s)  — >  f(s)  as 
n  — >  oo,  for  every  s  G  [0,  T7].  □ 

We  say  that  a  function  a  :  M+  — >  R  is  a  finite  variation  function  on  M+  if  the 
restriction  of  a  to  [0,  T]  has  finite  variation  on  [0,  T],  for  every  T  >  0.  Then  there  is 
a  unique  a -finite  (positive)  measure  on  M+  whose  restriction  to  every  interval  [0,  T\ 
is  the  total  variation  measure  of  the  restriction  of  a  to  [0,  T],  and  we  write 


f 


oo 


f(s)  |da(s) 


for  the  integral  of  a  nonnegative  Borel  function/  on  R+  with  respect  to  this  a -finite 
measure.  Furthermore,  we  can  define 


poo 

Jo 


f(s)Aa(s) 


—  lim  / 

T->ooJ0 


f(s)da(s)  e  (—oo,  oo) 


for  any  real  Borel  function/  on  M+  such  that  /0°°  \f(s)\  | d<^ (^) |  <  oo 


4.1.2  Finite  Variation  Processes 

We  now  consider  random  variables  and  processes  defined  on  a  filtered  probability 
space  (J/),F). 

Definition  4.4  An  adapted  process  A  —  (Ar)r>o  is  called  a  finite  variation  process 
if  all  its  sample  paths  are  finite  variation  functions  on  M+ .  If  in  addition  the  sample 
paths  are  nondecreasing  functions,  the  process  A  is  called  an  increasing  process. 

Remark  In  particular,  Ao  =  0  and  the  sample  paths  of  A  are  continuous  -  one 
can  define  finite  variation  processes  with  cadlag  sample  paths,  but  in  this  book 
we  consider  only  the  case  of  continuous  sample  paths.  Our  special  convention 
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that  the  initial  value  of  a  finite  variation  process  is  0  will  be  convenient  for  certain 
uniqueness  statements. 

If  A  is  a  finite  variation  process,  the  process 

Vt=  [  \dAs 
Jo 

is  an  increasing  process.  Indeed,  it  is  clear  that  the  sample  paths  of  V  are 
nondecreasing  functions  (as  well  as  continuous  functions  that  vanish  at  t  —  0).  The 
fact  that  Vt  is  an  ^-measurable  random  variable  can  be  deduced  from  the  second 
part  of  Proposition  4.2.  Writing  At  —  \{Vt  +  At)  —  \{Vt—  At)  shows  that  any  finite 
variation  process  can  be  written  as  the  difference  of  two  increasing  processes  (the 
converse  is  obvious). 

Proposition  4.5  Let  A  be  a  finite  variation  process,  and  let  H  be  a  progressive 
process  such  that 


Vt  >  0,  Vco  e  £2 , 


l 


Hs(a>)  |  |dAj(ft»)| 


<  00. 


Then  the  process  H  •  A  —  (( H  •  A)r)r>o  defined  by 


Hs  dAs 


is  also  a  finite  variation  process. 


Proof  By  the  observations  preceding  the  statement  of  Proposition  4.2,  we  know 
that  the  sample  paths  of  H  •  A  are  finite  variation  functions.  It  remains  to  verify 
that  the  process  H  •  A  is  adapted.  To  this  end,  it  is  enough  to  check  that,  if  t  >  0 
is  fixed,  if  h  :  £2  x  [0,  t\  — >  R  is  measurable  for  the  a-field  0  ^([0,  t]),  and 
if  ft,  \h(co , s)\\dAs(co)\  <  oo  for  every  co,  then  the  variable  /Qr  h(co ,  s)dAs(co)  is 
measurable. 

If  h(co,s)  =  1  (UtV](s)lr(o))  with  (u,  v]  c  [0,t]  and  T  e  the  result 
is  immediate  since  h(co,  s)dAs(co)  =  lr(co)  (Av(co)  —  Au(co))  in  that  case.  A 
monotone  class  argument  (see  Appendix  Al)  then  gives  the  case  h  —  1  g,  G  e 
fifit  0  mo,  t]).  Finally,  in  the  general  case,  we  observe  that  we  can  write  h  as  a 
pointwise  limit  of  a  sequence  of  simple  functions  (i.e.  finite  linear  combinations  of 
indicator  functions  of  measurable  sets)  hn  such  that  \hn\  <  \h\  for  every  n ,  and  that 
we  then  have  hn(co ,  s)6As(co)  — >  h(co ,  s)dAs(co)  by  dominated  convergence, 
for  every  co  e  Q .  □ 
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Remarks 

(i)  It  happens  frequently  that  instead  of  the  assumption  of  the  proposition  we  have 
the  weaker  assumption 


a.s.  Vf  >  0 


.  / 


Hs(a>) |  |dL4s(w)|  <  oo. 


If  the  filtration  is  complete,  we  can  still  define  H  •  A  as  a  finite  variation  process 
under  this  weaker  assumption.  We  replace  H  by  the  process  H'  defined  by 


H,(w)  if  /"  |flj(<y)|  |cMs(ft>)|  <oo,  Vn  , 
0  otherwise. 


Thanks  to  the  fact  that  the  filtration  is  complete,  the  process  H'  is  still 
progressive,  which  allows  us  to  define  H -A  =  H'  -A.  We  will  use  this  extension 
of  Proposition  4.5  implicitly  in  what  follows. 

(ii)  Under  appropriate  assumptions  (if  H  and  K  are  progressive  and  \HS  \  \6AS\  < 
oo,  /q  \HsKs  \  | dAs |  <  oo  for  every  t  >  0),  we  have  the  “associativity”  property 

K  •  (H  -A)  =  C KH )  -  A.  (4.1) 


This  indeed  follows  from  the  analogous  deterministic  result  saying  informally 
thatk(s)  ( h(s )  =  (k(s)h(s))  fi(ds)  if  h(s )  and  k(s)h(s )  are  integrable  with 

respect  to  the  signed  measure  /x  on  [0,  t\. 

An  important  special  case  of  the  proposition  is  the  case  where  At  —  t.  If  H  is  a 
progressive  process  such  that 


Wt  ^  0,  V co  g  i T2, 


ds  <  oo, 


the  process  JqHs  d^  is  a  finite  variation  process. 


4.2  Continuous  Local  Martingales 

We  consider  again  a  filtered  probability  space  (£2,^,  (^t) ,  P) .  If  T  is  a  stopping 
time,  and  if  X  =  (Xt)t>o  is  an  adapted  process  with  continuous  sample  paths,  we 
will  write  XT  for  process  X  stopped  at  T,  defined  by  Xj  =  XtAT  for  every  t  >  0.  It  is 
useful  to  observe  that,  if  S  is  another  stopping  time, 


( XT)S  =  ( XS)T  =  Xs  AT. 
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Definition  4.6  An  adapted  process  M  —  (Mt)t> o  with  continuous  sample  paths 
and  such  that  Mq  =  0  a.s.  is  called  a  continuous  local  martingale  if  there  exists  a 
nondecreasing  sequence  (Tn)n>o  of  stopping  times  such  that  Tn  \  oo  (i.e.  Tn{co)  \  oo 
for  every  co)  and,  for  every  n ,  the  stopped  process  MTn  is  a  uniformly  integrable 
martingale. 

More  generally,  when  we  do  not  assume  that  M0  =  0  a.s.,  we  say  that  M  is 
a  continuous  local  martingale  if  the  process  Nt  —  Mt  —  Mo  is  a  continuous  local 
martingale. 

In  all  cases,  we  say  that  the  sequence  of  stopping  times  ( Tn )  reduces  M  if  Tn  \  oo 
and,  for  every  n ,  the  stopped  process  MTn  is  a  uniformly  integrable  martingale. 

Remarks 

(i)  We  do  not  require  in  the  definition  of  a  continuous  local  martingale  that 
the  variables  Mt  are  in  L1  (compare  with  the  definition  of  martingales).  In 
particular,  the  variable  Mo  may  be  any  -measurable  random  variable. 

(ii)  Any  martingale  with  continuous  sample  paths  is  a  continuous  local  martingale 
(see  property  (a)  below)  but  the  converse  is  false,  and  for  this  reason  we  will 
sometimes  speak  of  “true  martingales”  to  emphasize  the  difference  with  local 
martingales.  Let  us  give  a  few  examples  of  continuous  local  martingales  which 
are  not  (true)  martingales.  If  B  is  an  (J^)-Brownian  motion  started  from  0, 
and  Z  is  an  -measurable  random  variable,  the  process  Mt  =  Z  +  Bt  is 
always  a  continuous  local  martingale,  but  is  not  a  martingale  if  E[\Z\]  —  oo. 
If  we  require  the  property  Mo  —  0,  we  can  also  consider  Mt  —  ZBt ,  which  is 
always  a  continuous  local  martingale  (see  Exercise  4.22)  but  is  not  a  martingale 
if  E[\Z\]  =  oo.  For  a  less  artificial  example,  we  refer  to  question  (8)  of 
Exercise  5.33. 

(iii)  One  can  define  a  notion  of  local  martingale  with  cadlag  sample  paths.  In  this 
course,  however,  we  consider  only  continuous  local  martingales. 

The  following  properties  are  easily  established. 

Properties  of  continuous  local  martingales. 

(a)  A  martingale  with  continuous  sample  paths  is  a  continuous  local  martingale, 
and  the  sequence  Tn  —  n  reduces  M. 

(b)  In  the  definition  of  a  continuous  local  martingale  starting  from  0,  one  can 
replace  “uniformly  integrable  martingale”  by  “martingale”  (indeed,  one  can 
then  observe  that  MT,lAn  is  uniformly  integrable,  and  we  still  have  Tn  A  n  t  oo). 

(c)  If  M  is  a  continuous  local  martingale,  then,  for  every  stopping  time  T,  MT  is  a 
continuous  local  martingale  (this  follows  from  Corollary  3.24). 

(d)  If  ( Tn )  reduces  M  and  if  (Sn)  is  a  sequence  of  stopping  times  such  that  Sn  t 
then  the  sequence  ( Tn  A  Sn)  also  reduces  M  (use  Corollary  3.24  again). 
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(e)  The  space  of  all  continuous  local  martingales  is  a  vector  space  (to  check  stability 
under  addition,  note  that  if  M  and  M'  are  two  continuous  local  martingales  such 
that  Mo  —  0  and  M'0  —  0,  if  the  sequence  ( Tn )  reduces  M  and  if  the  sequence 
(T'n)  reduces  M',  property  (d)  shows  that  the  sequence  Tn  A  T'n  reduces  M  +  M'). 

The  next  proposition  gives  three  other  useful  properties  of  local  martingales. 

Proposition  4.7 

(i)  A  nonnegative  continuous  local  martingale  M  such  that  Mo  £  Ll  is  a 
supermartingale. 

(ii)  A  continuous  local  martingale  M  such  that  there  exists  a  random  variable  Z  £ 
Ll  with  \Mt\  <  Z  for  every  t  >  0  (in  particular  a  bounded  continuous  local 
martingale )  is  a  uniformly  integrable  martingale. 

(iii)  If  M  is  a  continuous  local  martingale  and  Mo  =  0  (or  more  generally  Mo  £ 
Ll),  the  sequence  of  stopping  times 

Tn  —  inf{t  >  0  :  \Mt\  >  n) 


reduces  M. 

Proof 

(i)  Write  Mt  =  Mo  +  Nt.  By  definition,  there  exists  a  sequence  (Tn)  of  stopping 
times  that  reduces  N.  Then,  if  s  <  t,  we  have  for  every  n , 


NsATn  —  E[Nt  ATn 


We  can  add  on  both  sides  the  random  variable  M0  (which  is  ZZo -measurable 
and  in  L1  by  assumption),  and  we  get 


M,sATn  —  E[Mt  ATn 


Since  M  takes  nonnegative  values,  we  can  now  let  n  tend  to  oo  and  apply  the 
version  of  Fatou’s  lemma  for  conditional  expectations,  which  gives 


Ms  >  E[Mt  |  &s\. 


Taking  s  —  0,  we  get  E[Mt]  <  E[Mf\  <  oo,  hence  Mt  £  Ll  for  every  t  >  0. 
The  previous  inequality  now  shows  that  M  is  a  supermartingale. 

(ii)  By  the  same  argument  as  in  (i),  we  get  for  0  <  s  <  t. 


MSATn  —  E[Mf ATn  I  ^s\ 


(4.2) 


Since  | MtATn  <  Z,  we  can  use  dominated  convergence  to  obtain  that  the 
sequence  MtATn  converges  to  Mt  in  Ll.  We  can  thus  pass  to  the  limit  n  -> 
in  (4.2),  and  get  that  Ms  —  E[Mt  \  &s]. 


oo 
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(iii)  Suppose  that  Mo  =  0.  The  random  times  Tn  are  stopping  times  by  Proposi¬ 
tion  3.9.  The  desired  result  is  an  immediate  consequence  of  (ii)  since  MTn  is  a 
continuous  local  martingale  and  \MTn  \  <  n.  If  we  only  assume  that  Mo  e  L1 , 
we  observe  that  MTn  is  dominated  by  n  +  |Mo  | .  □ 

Remark  Considering  property  (ii)  of  the  proposition,  one  might  expect  that  a 
continuous  local  martingale  M  such  that  the  collection  (M,)r>0  is  uniformly 
integrable  (or  even  a  continuous  local  martingale  satisfying  the  stronger  property 
of  being  bounded  in  LP  for  some  p  >  1)  is  automatically  a  martingale.  This  is 
incorrect!!  For  instance,  if  B  is  a  three-dimensional  Brownian  motion  started  from 
i  /  0,  the  process  Mt  —  l/\Bt\  is  a  continuous  local  martingale  bounded  in  L2,  but 
is  not  a  martingale:  see  Exercise  5.33. 

Theorem  4.8  Let  M  be  a  continuous  local  martingale.  Assume  that  M  is  also  a 
finite  variation  process  (in  particular  Mo  =  0).  Then  Mt  —  d  for  every  t  >  0,  a.s. 

Proof  Set 


zn  —  inf {t  >  0  :  f  \dMs\  >  n) 

Jo 

for  every  integer  n  >  0.  By  Proposition  3.9,  rn  is  a  stopping  time  (recall  that  /Qr  |d Ms 
is  an  increasing  process  if  M  is  a  finite  variation  process). 

Fix  n  >  0  and  set  N  —  MTn .  Note  that,  for  every  t  >  0, 


IVI  =  I  MtAZn 


<  fl. 


By  Proposition  4.7,  Af  is  a  (bounded)  martingale.  Let  t  >  0  and  let  0  =  to  <  t\  < 
^  tp  —  t  be  any  subdivision  of  [0,  t\.  Then,  from  Proposition  3.14,  we  have 


E[Nf]  =  y>[(JV,  -  v,_,)2] 

i=  1 


<  E 


<  nE 


sup  | Nti-Nti_t 

1  <i<p 


noting  that  \ dNs  \  <  n  by  the  definition  of  rn,  and  using  Proposition  4.2. 

We  now  apply  the  preceding  bound  to  a  sequence  0  =  <  t\  <  •  •  •  <  4  =  rof 

subdivisions  of  [0,  t]  whose  mesh  tends  to  0.  Using  the  continuity  of  sample  paths, 
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and  the  fact  that  N  is  bounded  (to  justify  dominated  convergence),  we  get 


lim  E  sup  \Ntk  —  Njk 

fc— >-00  L  1  <i<p/<  i  i  1 


=  0. 


We  then  conclude  that  E[Nf]  =  0,  hence  MtATn  —  0  a.s.  Letting  n  tend  to  00,  we  get 
thatMr  =  0a.s.  □ 


4.3  The  Quadratic  Variation  of  a  Continuous  Local 
Martingale 

From  now  on  until  the  end  of  this  chapter  (and  in  the  next  chapter),  we  assume  that 
the  filtration  (J^)  is  complete.  The  next  theorem  will  play  a  very  important  role  in 
forthcoming  developments. 

Theorem  4.9  Let  M  —  {Mt)t> 0  be  a  continuous  local  martingale.  There  exists  an 
increasing  process  denoted  by  ((M,M)t)t> 0,  which  is  unique  up  to  indistinguisha- 
bility,  such  that  My  —  ( M,M)t  is  a  continuous  local  martingale.  Furthermore,  for 
every  fixed  t  >  0,  if  0  =  <  t}{  <  •  •  •  <  tpn  —  t  is  an  increasing  sequence  of 

subdivisions  of[ 0,  t]  with  mesh  tending  to  0,  we  have 


Pn 

lim  (4.3) 

n—>oo  ‘  J  1  1  1 

i=  1 

in  probability.  The  process  ( M ,  M)  is  called  the  quadratic  variation  ofM. 

Let  us  immediately  mention  an  important  special  case.  If  M  —  B  is  an  (J^)- 
Brownian  motion  (see  Definition  3.11)  then  B  is  a  martingale  with  continuous 
sample  paths,  hence  a  continuous  local  martingale.  Then  by  comparing  (4.3)  with 
Proposition  2.16,  we  get  that,  for  every  t  >  0, 


(. B,B),  =  t. 


So  the  quadratic  variation  of  a  Brownian  motion  is  the  simplest  increasing  process 

one  can  imagine. 

Remarks 

(i)  We  observe  that  the  process  (M,M)  does  not  depend  on  the  initial  value  Mo, 
but  only  on  the  increments  of  M:  if  Mt  —  Mo  +  Nt,  we  have  (M,  M)  =  (N,  N). 
This  is  obvious  from  the  second  assertion  of  the  theorem,  and  this  will  also  be 
clear  from  the  proof  that  follows. 

(ii)  In  the  second  assertion  of  the  theorem,  it  is  in  fact  not  necessary  to  assume  that 
the  sequence  of  subdivisions  is  increasing. 
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Proof  We  start  by  proving  the  first  assertion.  Uniqueness  is  an  easy  consequence 
of  Theorem  4.8.  Indeed,  let  A  and  A'  be  two  increasing  processes  satisfying  the 
condition  given  in  the  statement.  Then  the  process  At  —A't  —  (Mj  —A't)  —  ( M ]  —At) 
is  both  a  continuous  local  martingale  and  a  finite  variation  process.  It  follows  that 
A -A7  =  0. 

In  order  to  prove  existence,  consider  first  the  case  where  M0  =  0  and  M  is 
bounded  (hence  M  is  a  true  martingale,  by  Proposition  4.7  (ii)).  Fix  K  >  0  and  an 
increasing  sequence  0  =  ft  <  f[  <  •  •  •  <  f  —  K  of  subdivisions  of  [0,  K\  with 
mesh  tending  to  0. 

We  observe  that,  for  every  0  <  r  <  s  and  for  every  bounded  ^-measurable 
variable  Z,  the  process 


Nt  —  Z(MsAt  —  MrAt) 

is  a  martingale  (the  reader  is  invited  to  write  down  the  easy  proof!).  It  follows  that, 
for  every  n ,  the  process 


Pn 

i=  1 

is  a  (bounded)  martingale.  The  reason  for  considering  these  martingales  comes  from 
the  following  identity,  which  results  from  a  simple  calculation:  for  every  n ,  for  every 
j  €  {0,  1,  .  .  .  ,pn}. 


n 


Mi  -  2 Xtn 

7  7 


(4.4) 


i=  1 


Lemma  4.10  We  have 


lim  E[(XnK  -  X %)2]  =  0. 

n,m— >oo 

Proof  of  the  lemma  Let  us  fix  n  <  m  and  evaluate  the  product  E[XnKX’^].  This 
product  is  equal  to 


Pn  Pm 

y:  y:  E[Mt»_]  (m^  m^_i  )  m^i  (m^  m^_x  )] . 

i=  1  j=  1 

In  this  double  sum,  the  only  terms  that  may  be  nonzero  are  those  corresponding 
to  indices  i  and  j  such  that  the  interval  (t”X  ,  f1  ]  is  contained  in  (E, ,  f  1.  Indeed, 
suppose  that  f  <  (the  symmetric  case  f1  <  is  treated  in  an  analogous  way). 


4.3  The  Quadratic  Variation  of  a  Continuous  Local  Martingale 


81 


Then,  conditioning  on  the  a -field  ,  we  have 


E[Mf}  |  (Mtn  —  My  | )  Mtm  |  (Mtm  —  Mrm  | )] 

=  E[Mtn_^  (Mrn  —  )  Mfv_ !  E\Mtm  —  Mtm_^ 


For  every  j  —  1, . . .  ,pm,  write  in,mU )  for  the  unique  index  i  such  that  (z'E, ,  tj1]  C 
(^7_! ,  It  follows  from  the  previous  considerations  that 


£[«] 


E 

1  iSj  —Pm  >  i — in,m  (j) 


M?. i,)]. 


In  each  term  E[Mr»_i  (Mt>>  —  M^_,)  (M/«  —  M^)],  we  can  now  decompose 


Mf  —  \  —  Mfn_  ) 

{  z  1  k  k  1 


k-hi,m  (k) — i 


and  we  observe  that,  if  k  is  such  that  zw,m  (&)  =  i  but  k  ^  j, 

( Mfn  -  Mpn_x)]  =  0 

(condition  on  if  k  >  j  and  on  if  k  <  j).  The  only  case  that  remains  is 
k  —  j,  and  we  have  thus  obtained 

E[X'kX'k]  =  J2  W»_,  )2] . 

1  ^J—Pnit  i=in,mU) 


As  a  special  case  of  this  relation,  we  have 


E[(X'k)2}  =  E  - M^)2]. 

•  </</?/„ 


Furthermore, 


«)2] 


\<i<Pn 

=  E  ^^^[(M^-M^)2!^]] 

=  E  £ki.  E  ^[(%-m^)2  i^_j 


!</</?/ 


j  •  hi,  m  (j)  i 


1  —Pm  >  i — hi,m  (j) 


where  we  have  used  Proposition  3.14  in  the  third  equality. 
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If  we  combine  the  last  three  displays,  we  get 


E[(X’‘  -  X”)2]  =  E 


1  —Pm 5  i —  hijri (j) 


Using  the  Cauchy-Schwarz  inequality,  we  then  have 


E[(XnK  -  X%)2]  <  E 


sup  (M”_,  -MijlJ 

1  —I  —Pm  i  i  hi.m  (j) 


1 1/2 


x£[(  E  (M^-M^)2) 


2-i  1/2 


•  <i<p 


m 


By  the  continuity  of  sample  paths  (together  with  the  fact  that  the  mesh  of  our 
subdivisions  tends  to  0)  and  dominated  convergence,  we  have 


lim  E 


n,m^oo,n<m  L  i=inm(j) 


sup 


=  0. 


To  complete  the  proof  of  the  lemma,  it  is  then  enough  to  prove  the  existence  of  a 
finite  constant  C  such  that,  for  every  m, 


_  ^ 
q(  E  (M<?  -  >/) 


<  c. 


(4.5) 


•  <i<p 


m 


Let  A  be  a  constant  such  that  \Mt\  <  A  for  every  t  >  0.  Expanding  the  square  and 
using  Proposition  3.14  twice,  we  have 


2  " 

E  (  E  (Mtf  - 


i  </<Pi 


777 


=  ^  '  (Mfin  —  Mfn  |  ^  '  (Mfii  —  Mjjn  ^  {Mfn  —  Mt™  |  )2 


1  </'<P. 


777 


•  <i<k<p 


m 


<  4A2E[  E 

1  </<Pm 
Pm  1 


Pm  A  Pm 

+2  ^  —  M^)2^  ^  (M^  —Mfn_y 


^f} 


j=  i  ^=7+ 1 

4A2e|^  ^  {Mfn  —  Mfn_x)2 
1  </<Pm 


Pm  1 

+2  E  -M;»)2  I 


L  J 
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<12 A2e\  (M,j  -  M,»L]  )2 

1  <j<Pm 

=  12 A2  E[(MK  -  M0)2] 

<  48 A4 


which  gives  the  bound  (4.5)  with  C  =  48A4.  This  completes  the  proof.  □ 

We  now  return  to  the  proof  of  the  theorem.  Thanks  to  Doob’s  inequality  in  L2 
(Proposition  3.15  (ii)),  and  to  Lemma  4.10,  we  have 


lim 

n,m—>oo 


sup(JV 

L  t<K 


(4.6) 


In  particular,  for  every  t  G  [0,/T],  (Xf)n>\  is  a  Cauchy  sequence  in  L 2  and  thus 
converges  in  L2.  We  want  to  argue  that  the  limit  yields  a  process  Y  indexed  by  [0,  K\ 
with  continuous  sample  paths.  To  see  this,  we  note  that  (4.6)  allows  us  find  a  strictly 
increasing  sequence  (rik)k>  t  of  positive  integers  such  that,  for  every  k  >  1, 


suppQ 
L  t<K 


Kk+ 1 


This  implies  that 


<  oo 


and  thus 


oo 


E 


k=  1  - 


sup  \X, 

t<K 


Kk+ 1 


-  Xr!k  <  oo 


a.s. 


Consequently,  except  on  the  negligible  set  c/C  where  the  series  in  the  last  display 
diverges,  the  sequence  of  random  functions  (X^ ,  0  <  t  <  K)  converges  uniformly 
on  [0,  K]  as  k  ->  oo,  and  the  limiting  random  function  is  continuous  by  uniform 
convergence.  We  can  thus  set  Yt{co)  —  lim Xlt2k(co),  for  every  t  g  [0,K],  if  co  e 
Q\JY ,  and  Yt(co)  =  0,  for  every  t  e  [0,  A],  if  co  e  JY .  The  process  (Yt)o<t<x 
has  continuous  sample  paths  and  Yt  is  ^-measurable  for  every  t  G  [0,  K\  (here  we 
use  the  fact  that  the  filtration  is  complete,  which  ensures  that  JY  G  YPt  for  every 
t  >  0).  Furthermore,  since  the  L2-limit  of  ( Xf^)n>\  must  coincide  with  the  a.s.  limit 
of  a  subsequence,  Yt  is  also  the  limit  of  Xnt  in  L2,  for  every  t  G  [0,  K\,  and  we  can 
pass  to  the  limit  in  the  martingale  property  for  Xn ,  to  obtain  that  E[Yt  \  YPS]  —  Ys 
for  every  0  <  s  <  t  <  K.  It  follows  that  ( YtAK)t>o  is  a  martingale  with  continuous 
sample  paths. 
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On  the  other  hand,  the  identity  (4.4)  shows  that  the  sample  paths  of  the  process 
M2  —2X"  are  nondecreasing  along  the  finite  sequence  (fj ,  0  <  i  <  pn).  By  passing  to 
the  limit  n  ->  oo  along  the  sequence  (itk)k>  t,  we  get  that  the  sample  paths  of  M2  — 
2  Yt  are  nondecreasing  on  [0,  K\,  except  maybe  on  the  negligible  set  JV .  For  every 
t  g  [0,  K\,  we  set  a\K)  —  M 2  —  2 Yt  on  Q\JA ,  and  a\K)  =  0  on  JY .  Then  A^)  = 
0,  A is  ^-measurable  for  every  t  G  [0,/sf],  A^K)  has  nondecreasing  continuous 
sample  paths,  and  (M2aA  —  AjAA)r>o  is  a  martingale. 

We  apply  the  preceding  considerations  with  K  —  f,  for  every  integer  l  >  1, 

and  we  get  a  process  (A;  )o<f<^.  We  then  observe  that,  for  every  l  >  1,  A/a£  = 

m 

ArA^  for  every  t  >  0,  a.s.,  by  the  uniqueness  argument  explained  at  the  beginning 
of  the  proof.  It  follows  that  we  can  define  an  increasing  process  (M,  M)  such  that 
(M,  M)t  =  Ap  for  every  t  G  [0,  l\  and  every  l  >  1,  a.s.,  and  clearly  M2  —  (M,  M)t 
is  a  martingale. 

In  order  to  get  (4.3),  we  observe  that,  if  K  >  0  and  the  sequence  of  subdivisions 
0  =  ^  <  t\  <  •  •  •  <  tpn  —  K  are  fixed  as  in  the  beginning  of  the  proof,  the 

process  AtAK  must  be  indistinguishable  from  (M,M)tAK,  again  by  the  uniqueness 
argument  (we  know  that  both  M2aA  —  a|aA  and  M2aA  —  (M,  M)tAK  are  martingales). 
In  particular,  we  have  ( M,M)K  —  AK  a.s.  Then,  from  (4.4)  with  j  —  pn ,  and  the 
fact  that  XA  converges  in  L 2  to  YK  =  \{M2K  —  A we  get  that 


Pn 


lim  Y' (M,n  -  Mff  )2  =  (M,  M) 

n— >oo  J 


K 


7=1 


in  L 2 .  This  completes  the  proof  of  the  theorem  in  the  case  when  Mo  =  0  and  M  is 
bounded. 

Let  us  consider  the  general  case.  Writing  Mt  —  Mo  +  Nt,  so  that  M2  = 
Mq  +  2MoNt  +  N2,  and  noting  that  M$Nt  is  a  continuous  local  martingale  (see 
Exercise  4.22),  we  see  that  we  may  assume  that  M0  =  0.  We  then  set 


Tn  —  inf{t  >  0  :  \Mt\  > 

and  we  can  apply  the  bounded  case  to  the  stopped  martingales  MTn.  Set  A^  = 
(. MTn ,  Mr,? ) .  The  uniqueness  part  of  the  theorem  shows  that  the  processes  a|aA^  and 

a|/?]  are  indistinguishable.  It  follows  that  there  exists  an  increasing  process  A  such 
that,  for  every  n ,  the  processes  A^^  and  AJ  are  indistinguishable.  By  construction, 
M2aA  —  Ajat^  is  a  martingale  for  every  n ,  which  precisely  implies  that  M2  —  Af  is  a 
continuous  local  martingale.  We  take  (M,M)t  =  At,  which  completes  the  proof  of 
the  existence  part  of  the  theorem. 

Finally,  to  get  (4.3),  it  suffices  to  consider  the  case  Mo  =  0.  The  bounded  case 
then  shows  that  (4.3)  holds  if  M  and  (M,  M)t  are  replaced  respectively  by  MTn  and 
(M,M)tATn  (even  with  convergence  in  L2).  Then  it  is  enough  to  observe  that,  for 
every  t  >  0,  P(t  <  Tn )  converges  to  1  when  n  — >  oo.  □ 
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Proposition  4.11  Let  M  be  a  continuous  local  martingale  and  let  T  be  a  stopping 
time.  Then  we  have  a.s.  for  every  t  >  0, 

(MT ,  Mt),  =  (M,M),at. 

This  follows  from  the  fact  that  M2aT  —  (M,  M)tAT  is  a  continuous  local  martingale 
(cf.  property  (c)  of  continuous  local  martingales). 

Proposition  4.12  Let  M  be  a  continuous  local  martingale  such  that  Mo  =  0.  Then 
we  have  ( M ,  M)  —  0  if  and  only  if  M  —  0. 

Proof  Suppose  that  (M,  M)  =  0.  Then  M2  is  a  nonnegative  continuous  local 
martingale  and,  by  Proposition  4.7  (i),  M2  is  a  supermartingale,  hence  E[M2]  < 
E[M q]  =  0,  so  that  Mt  —  0  for  every  t.  The  converse  is  obvious.  □ 

The  next  theorem  shows  that  properties  of  a  continuous  local  martingale  are 
closely  related  to  those  of  its  quadratic  variation.  If  A  is  an  increasing  process,  A0 0 
denotes  the  increasing  limit  of  At  as  t  ->  oo  (this  limit  always  exists  in  [0,  oo]). 

Theorem  4.13  Let  M  be  a  continuous  local  martingale  with  Mo  G  L  . 

(i)  The  following  are  equivalent: 

(a)  M  is  a  (true)  martingale  bounded  in  L2. 

(b)  E[(M,M) oo]  <  oo. 

Furthermore,  if  these  properties  hold,  the  process  M2  —  (M,  M)t  is  a  uniformly 
integrable  martingale,  and  in  particular  7s[M^J  =  E[Mq\  +  E[(M,  M)^]. 

(ii)  The  following  are  equivalent: 

(a)  M  is  a  (true)  martingale  and  Mt  G  L2  for  every  t  >  0. 

(b)  E[(M,M)t\  <  oo  for  every  t  >  0. 

Furthermore,  if  these  properties  hold,  the  process  M2  —  (M,  M)t  is  a  martingale. 

Remark  In  property  (a)  of  (i)  (or  of  (ii)),  it  is  essential  to  suppose  that  M  is  a 
martingale,  and  not  only  a  continuous  local  martingale.  Doob’s  inequality  used  in 
the  following  proof  is  not  valid  in  general  for  a  continuous  local  martingale! 

Proof 

(i)  Replacing  M  by  M  —  Mo,  we  may  assume  that  Mo  =  0  in  the  proof.  Let  us 
first  assume  that  M  is  a  martingale  bounded  in  L2 .  Doob’s  inequality  in  L2 
(Proposition  3.15  (ii))  shows  that,  for  every  T  >  0, 


sup  M: 

L0  <t<T 


<  4E[Mj\. 


By  letting  T  go  to  oo,  we  have 


sup  Mf 

L  t>  o 


<  4sup£[M2]  —\C<oo. 

t>  0 
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Set  Sn  —  inf{t  >  0  :  ( M,M)t  >  n}.  Then  the  continuous  local  martingale 
MjAS/  —  (M,  M)tAsn  is  dominated  by  the  variable 

sup  M2  +  n, 

s>  0 


which  is  integrable.  From  Proposition  4.7  (ii),  we  get  that  this  continuous  local 
martingale  is  a  uniformly  integrable  martingale,  hence 


E[(M,  M)tAs„]  =  E[M2]  <  E 


sup  M; 

,v>0 


<  C. 


By  letting  n ,  and  then  t  tend  to  infinity,  and  using  monotone  convergence,  we  get 
E[(M,M) oo]  <  C<  oo. 

Conversely,  assume  that  £'[(M,M)00]  <  oo.  Set  Tn  —  inf{t  >  0  :  |M^|  >  n}. 
Then  the  continuous  local  martingale  Mr2Ar  —  (M,M)tATn  is  dominated  by  the 
variable 


n2  +  (M,M)  oo, 

which  is  integrable.  From  Proposition  4.7  (ii)  again,  this  continuous  local 
martingale  is  a  uniformly  integrable  martingale,  hence,  for  every  t  >  0, 

E[M2aT J  -  E[{M,M)tATn]  <  £[(M,M)oo]  =:  C'  <  oo. 

By  letting  n  ->  oo  and  using  Fatou’s  lemma,  we  get  E[Mj]  <  C',  so  that  the 
collection  (Mr)r>0  is  bounded  in  L2.  We  have  not  yet  verified  that  (Mt)t> o  is  a 
martingale.  However,  the  previous  bound  on  £[M2Ar  ]  shows  that  the  sequence 

(MtATn)n> i  is  uniformly  integrable,  and  therefore  converges  both  a.s.  and  in  L1  to 
Mu  for  every  t  >  0.  Recalling  that  MTn  is  a  martingale  (Proposition  4.7  (iii)),  the 
L1 -convergence  allows  us  to  pass  to  the  limit  n  ->  oo  in  the  martingale  property 
|  J^s.]  =  MsaTii,  for  0  <  s  <  t,  and  to  get  that  M  is  a  martingale. 

Finally,  if  properties  (a)  and  (b)  hold,  the  continuous  local  martingale  M 2  — 
( M ,  M)  is  dominated  by  the  integrable  variable 

supM2  +  (M,M)  oo 

r>0 

and  is  therefore  (by  Proposition  4.7  (ii))  a  uniformly  integrable  martingale. 

(ii)  It  suffices  to  apply  (i)  to  (MtAa)t> o  for  every  choice  of  a  >  0.  □ 
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4.4  The  Bracket  of  Two  Continuous  Local  Martingales 

Definition  4.14  If  M  and  N  are  two  continuous  local  martingales,  the  bracket 
( M ,  A}  is  the  finite  variation  process  defined  by  setting,  for  every  t  >  0, 

(M,  AO,  =  ^  ((M  +  TV,  M  +  N)t  -  (M,  M),  -  (N,  N)t). 

Let  us  state  a  few  easy  properties  of  the  bracket. 

Proposition  4.15 

(i)  ( M,N )  is  the  unique  (up  to  indistinguishability)  finite  variation  process  such 
that  MtNt  —  ( M,N)t  is  a  continuous  local  martingale. 

(ii)  The  mapping  (M,  N)  i->  (M,  N)  is  bilinear  and  symmetric. 

(iii)  If  0  =  t^  <  f l  <  mmm  <  1pn  —  t  is  an  increasing  sequence  of  subdivisions  of 
[0,  t\  with  mesh  tending  to  0,  we  have 


Pn 

lim  Y(M,n  -MfiVNfi  =  (M,  N), 

n — >oo  ‘  ^  1  1  1  1  11 

i=l 


in  probability. 

(iv)  For  every  stopping  time  T,  ( MT,NT)t  —  ( MT,N)t  =  ( M,N)tAp . 

(v)  IfM  and  N  are  two  martingales  ( with  continuous  sample  paths)  bounded  in  L2, 
MtNt  —  ( M,N)t  is  a  uniformly  integrable  martingale.  Consequently,  (M,N)oo 
is  well  defined  as  the  almost  sure  limit  of  ( M ,  N)t  as  t  —>  oo,  is  integrable,  and 
satisfies 


EIM^Noo]  =  E[M0N0]  +  E[(M,N)oo\. 

Proof  (i)  follows  from  the  analogous  characterization  in  Theorem  4.9  (uniqueness 
follows  from  Theorem  4.8).  Similarly  (iii)  is  a  consequence  of  the  analogous 
assertion  in  Theorem  4.9.  (ii)  follows  from  (iii),  or  can  be  proved  directly  via  the 
uniqueness  argument.  We  can  then  get  (iv)  as  a  consequence  of  property  (iii),  noting 
that  this  property  implies,  for  every  0  <  s  <  t,  a.s., 

(. Mt,Nt),  =  ( MT,N)t  =  (M,N)t  on  {T  >  t }, 

(. MT,NT)t  -  { Mt,Nt)s  =  { Mt,N),  -  { Mt,N)s  =  0  on  {T  <  s  <  t). 

Finally,  (v)  follows  as  a  consequence  of  Theorem  4. 13  (i).  □ 

Remark  A  consequence  of  (iv)  is  the  fact  that  MT(N  —  NT )  is  a  continuous  local 
martingale,  which  is  not  so  easy  to  prove  directly. 
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Proposition  4.16  Let  B  and  B'  be  two  independent  (J^)- -Brownian  motions.  Then 
(B,  B')t  =  Ofor  every  t  >  0. 

Proof  By  subtracting  the  initial  values,  we  may  assume  that  Bo  —  B'0  —  0.  We  then 
observe  that  the  process  Xt  =  ( Bt  +  B[)  is  a  martingale,  as  a  linear  combination 

of  martingales.  By  checking  the  finite-dimensional  marginals  of  X ,  we  verify  that  X 
is  also  a  Brownian  motion  (notice  that  we  do  not  claim  that  X  is  an  -Brownian 
motion).  Proposition  2.16  implies  that  ( X,X)t  —  t,  and,  using  the  bilinearity  of  the 
bracket,  it  follows  that  (B,  B')t  =  0.  □ 

Definition  4.17  Two  continuous  local  martingales  M  and  N  are  said  to  be  orthogo¬ 
nal  if  (M,  N)  —  0,  which  holds  if  and  only  if  MN  is  a  continuous  local  martingale. 

In  particular,  two  independent  (J^)- -Brownian  motions  are  orthogonal  martin¬ 
gales,  by  Proposition  4.16. 

If  M  and  N  are  two  orthogonal  martingales  bounded  in  L2,  we  have  E[MtNt\  — 
E[MqNo ],  and  even  E[MsNs]  —  E[MqNo]  for  any  stopping  time  S.  This  follows  from 
Theorem  3.22,  using  property  (v)  of  Proposition  4.15. 

Proposition  4.18  (Kunita-Watanabe)  Let  M  and  N  be  two  continuous  local 
martingales  and  let  H  and  K  be  two  measurable  processes.  Then,  a.s., 


Ks\\d(M,N)s 


< 


Proof  Only  in  this  proof,  we  use  the  special  notation  (M,  N)*s  —  ( M ,  N)t  —  ( M ,  N)s 
for  0  <  s  <  t.  The  first  step  of  the  proof  is  to  observe  that  we  have  a.s.  for  every 
choice  of  the  rationals  s  <  t  (and  also  by  continuity  for  every  reals  s  <  t). 


\(m,nys\  <  ffEMYs  fWXys. 

Indeed,  this  follows  from  the  approximations  of  ( M,M )  and  ( M,N )  given  in 
Theorem  4.9  and  in  Proposition  4.15  respectively  (note  that  these  approximations 
are  easily  extended  to  the  increments  of  ( M,M )  and  (. M,N )),  together  with  the 
Cauchy-Schwarz  inequality.  From  now  on,  we  fix  co  such  that  the  inequality  of  the 
last  display  holds  for  every  s  <  t,  and  we  argue  with  this  value  of  co  (the  remaining 
part  of  the  argument  is  “deterministic”). 

We  then  observe  that  we  also  have,  for  every  0  <  s  <  t, 

(  \d(M,  N)u\  < 

J  S 


(4.7) 
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Indeed,  we  use  Proposition  4.2,  noting  that,  for  any  subdivision  s  —  to  <  t\  <  •  •  •  < 
tp  —  t,  we  can  bound 


J2 1  <M’  *> 

i=  1 


U 

U — l 


u 

U — i 


/=  1  /=  1 

=  /(AUVH- 


We  then  get  that,  for  every  bounded  Borel  subset  A  of  R+, 


d(N,N)u. 


When  A  =  [s,  t],  this  is  the  bound  (4.7).  If  A  is  a  finite  union  of  intervals,  this  follows 
from  (4.7)  and  another  application  of  the  Cauchy-Schwarz  inequality.  A  monotone 
class  argument  shows  that  the  inequality  of  the  last  display  remains  valid  for  any 
bounded  Borel  set  A  (here  we  use  a  version  of  the  monotone  class  lemma  that  is 
different  from  the  one  in  Appendix  Al:  precisely,  a  class  of  sets  which  is  stable 
under  increasing  and  decreasing  sequential  limits  and  which  contains  an  algebra  of 
sets  must  contain  the  a -field  generated  by  this  algebra  -  see  the  first  chapter  of  [64]). 

Next  let  h  —  Yl?i=\  A/Ia,-  and  k  =  Yfi=i  be  two  nonnegative  simple 
functions  on  M+  with  bounded  support  contained  in  [0,  K\,  for  some  K  >  0.  Here 
Ai , . . . ,  Ap  is  a  measurable  partition  of  [0,  K\,  and  Ai , . . . ,  Xp,  ji\ , . . . ,  jip  are  reals 
(we  can  always  assume  that  h  and  k  are  expressed  in  terms  of  the  same  partition). 
Then, 


p 

h(s)k(s)\d(M ,  N) s\  =  Y:  XjjXj 

i=  1 

s  f 

i=  1  ^Ai  i=  1  ''A 

1/2  n  1/2 

h(s)2d(M,M)s)  (  /  k(s)2d{N,N)sj  , 

which  gives  the  desired  inequality  for  simple  functions.  Since  every  nonnegative 
Borel  function  is  a  monotone  increasing  limit  of  simple  functions  with  bounded 
support,  an  application  of  the  monotone  convergence  theorem  completes  the  proof. 

□ 
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4.5  Continuous  Semimartingales 

We  now  introduce  the  class  of  processes  for  which  we  will  develop  the  theory  of 
stochastic  integrals. 

Definition  4.19  A  process  X  —  (Xt)t>o  is  a  continuous  semimartingale  if  it  can  be 
written  in  the  form 


Xt  —  Mt  +  At, 


where  M  is  a  continuous  local  martingale  and  A  is  a  finite  variation  process. 

The  decomposition  X  =  M  +  A  is  then  unique  up  to  indistinguishability  thanks 
to  Theorem  4.8.  We  say  that  this  is  the  canonical  decomposition  of  X. 

By  construction,  continuous  semimartingales  have  continuous  sample  paths.  It  is 


possible  to  define  a  notion  of  semimartingale  with  cadlag  sample  paths,  but  in  this 


book,  we  will  only  deal  with  continuous  semimartingales,  and  for  this  reason  we 
sometimes  omit  the  word  continuous. 

Definition  4.20  Let  X  —  M  +  A  and  Y  —  M'  +  A'  be  the  canonical  decompositions 
of  two  continuous  semimartingales  X  and  Y.  The  bracket  (X,  Y)  is  the  finite  variation 
process  defined  by 


(X,Y)t=  (M,M')t. 


In  particular,  we  have  ( X ,  X)t  =  ( M ,  M)t. 

Proposition  4.21  Let  0  =  tg  <  f[  <  •  •  •  <  ^  —  t  be  an  increasing  sequence  of 
subdivisions  of[ 0,  t\  whose  mesh  tends  to  0.  Then, 


Pn 


in  probability. 

Proof  We  treat  the  case  where  X  —  Y  and  leave  the  general  case  to  the  reader. 
We  have 
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By  Theorem  4.9, 


Pn 


lim  V(M,«  -M^)2  =  =  {X,X)t, 

n—>oo  •  J  1  1  1 


i=l 


in  probability.  On  the  other  hand, 


Pn  Pn 

y ”_,}2  <  (  sup  |A,»  -  |A*»  -A,«_1 

i=l  i=i 

't 


< 


(  f  |dA5|)  sup  |A,«  —  Af}_x 

0  ^  1  <i<pn 


which  tends  to  0  a.s.  when  n  ->  oo  by  the  continuity  of  sample  paths  of  A.  The  same 
argument  shows  that 


Pn 


-Atn_x)(Mfi 


i=  1 


< 


sup  |M?« 
!<«</?« 


tends  to  0  a.s. 


□ 


Exercises 

In  the  following  exercises,  processes  are  defined  on  a  probability  space  (X2,  ^ ,P) 
equipped  with  a  complete  filtration  o,oo]- 

Exercise  4.22  Let  U  be  an  -measurable  real  random  variable,  and  let  M  be  a 
continuous  local  martingale.  Show  that  the  process  Nt  —  UMt  is  a  continuous  local 
martingale.  ( This  result  was  used  in  the  construction  of  the  quadratic  variation  of  a 
continuous  local  martingale.) 

Exercise  4.23 

1.  Let  M  be  a  (true)  martingale  with  continuous  sample  paths,  such  that  Mo  =  0.  We 
assume  that  (Mt)t> o  is  also  a  Gaussian  process.  Show  that,  for  every  t  >  0  and 
every  s  >  0,  the  random  variable  Mt+S  —  Mt  is  independent  of  cr(Mr,  0  <  r  <  t). 

2.  Under  the  assumptions  of  question  L,  show  that  there  exists  a  continuous 
monotone  nondecreasing  function/  :  M+  ->  M+  such  that  ( M,M)t  =  f(t) 
for  every  t  >  0. 
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Exercise  4.24  Let  M  be  a  continuous  local  martingale  with  Mo  =  0. 

1.  For  every  integer  n  >  1,  we  set  Tn  —  inf{t  >  0  :  \Mt\  =  n}.  Show  that,  a.s. 


oo 


lim  Mt  exists  and  is  finite j  =  =  oo}  C  {(M,  M}^  <  oo}, 


n=  1 


2.  We  set  Sn  =  inf {t  >  0  :  (M,  M)t  —  n}  for  every  n  >  1.  Show  that,  a.s., 


oo 

{{M,M) oo  <  oo}  =  C|{S„  =  oo}  C 

n=  1 


lim  Mt  exists  and  is  finite 

f->oo 


and  conclude  that 

<  lim  Mt  exists  and  is  finite)  =  {(M,  M}^  <  oo}  ,  a.s. 

I  t — >"00  j 

Exercise  4.25  For  every  integer  n  >  1,  let  Mn  =  (M")?> o  be  a  continuous  local 
martingale  with  Mq  =  0.  We  assume  that 


lim  {Mn,Mn) oo  =  0 

n—>oo 

in  probability. 

1.  Let  £  >  0,  and,  for  every  n  >  1,  let 

Tns  =  inf {f  >  0  :  (. Mn,Mn)t  >  e}. 

Justify  the  fact  that  Tn£  is  a  stopping  time,  then  prove  that  the  stopped  continuous 
local  martingale 


—  Mn 

~  iyit/\T»  ’ 


Vf  >  0  , 


is  a  true  martingale  bounded  in  L2. 
2.  Show  that 


sup  |MJV 

L  P>0 


<  4e2. 


3.  Writing,  for  every  a  >  0, 


P^sup  |M”|  >  a)  <  P^sup  \M''’S\  >  a )  +  P(P"  <  oo), 
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show  that 

lim  (  sup  \Mf\)  —  0 

n->oc  \  / 

in  probability. 

Exercise  4.26 

1.  Let  A  be  an  increasing  process  (adapted,  with  continuous  sample  paths  and  such 
that  A0  =  0)  such  that  Aoo  <  oo  a.s.,  and  let  Z  be  an  integrable  random  variable. 
We  assume  that,  for  every  stopping  time  T , 

£[^oo  —  At\  <  E[Z  l{r<oo}]« 

Show,  by  introducing  an  appropriate  stopping  time,  that,  for  every  A  >  0, 

E[(Aoo  —  A)  1{Aoo>a}]  ^  E[Z  1{Aoo>a}]- 

2.  Let /  :  M+  — >  R  be  a  continuously  differentiable  monotone  increasing  function 
such  that/(0)  =  0  and  set  F(x)  —  /0l/(t)d t  for  every  v  >  0.  Show  that,  under 
the  assumptions  of  question  L,  one  has 

E[F(Aoo)]  <  E[Zf(AOQ)]. 

(Hint:  It  may  be  useful  to  observe  that  F(x)  —  xf(x)  —  Xf' (A)  dX  for  every 

v  >  0.) 

3.  Let  M  be  a  (true)  martingale  with  continuous  sample  paths  and  bounded  in  L 2 
such  that  Mo  =  0,  and  let  M^  be  the  almost  sure  limit  of  Mt  as  t  — >  oo.  Show 
that  the  assumptions  of  question  1.  hold  when  At  —  ( M ,  M)t  and  Z  =  M^.  Infer 
that,  for  every  real  q  >  1 , 

E[((M,M)oo)q+l]  <  (q+l)E[({M,M)00)‘>M200]. 

4.  Let  p  >  2  be  a  real  number  such  that  E[((M,  M)0 o)p]  <  oo.  Show  that 

E[({M,M)0O)p]  <  pP  E^M00\2p], 

5.  Let  N  be  a  continuous  local  martingale  such  that  —  0,  and  let  T  be  a  stopping 
time  such  that  the  stopped  martingale  NT  is  uniformly  integrable.  Show  that,  for 
every  real  p  >  2, 


E[((N,N)tY]  <  ff  E[\Nt\2?]. 

Give  an  example  showing  that  this  result  may  fail  if  NT  is  not  uniformly 
integrable. 
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Exercise  4.27  Let  (Xt)t>o  be  an  adapted  process  with  continuous  sample  paths 
and  taking  nonnegative  values.  Let  (At)t> o  be  an  increasing  process  (adapted, 
with  continuous  sample  paths  and  such  that  A0  =  0).  We  consider  the  following 
condition: 

(D)  For  every  bounded  stopping  time  T ,  we  have  E[XT]  <  E[AT]. 

1.  Show  that,  if  M  is  a  square  integrable  martingale  with  continuous  sample  paths 
and  Mo  =  0,  the  condition  (D)  holds  for  Xt  —  Mf  and  At  —  (M,  M)t. 

2.  Show  that  the  conclusion  of  the  previous  question  still  holds  if  one  only  assumes 
that  M  is  a  continuous  local  martingale  with  Mo  =  0. 

3.  We  set  X*  =  sup^X?.  Show  that,  under  the  condition  (D),  we  have,  for  every 
bounded  stopping  time  S  and  every  c  >  0, 


P(X*s  >c)<  -E[AS], 

c 

(Hint:  One  may  apply  (D)  to  T  —  S  A  R,  where  R  =  inf{t  >  0  :  Xt  >  c}.) 

4.  Infer  that,  still  under  the  condition  (D),  one  has,  for  every  (finite  or  not)  stopping 
time  S , 


P(X*S  >  c)  <  -E[AS] 

c 

(when  S  takes  the  value  oo,  we  of  course  define  X ^  =  supy>0X5). 

5.  Let  c  >  0  and  d  >  0,  and  S  =  inf{t  >  0  :  At  >  d}.  Let  T  be  a  stopping  time. 
Noting  that 


{XT  >  c}  C  >  c}  U  {At  >  d 

show  that,  under  the  condition  (D),  one  has 


P(X*  >  c)  <  -E[At  a  d]  +  P(At  >  d). 

c 

6.  Use  questions  (2)  and  (5)  to  verify  that,  if  M^  is  a  sequence  of  continuous 
local  martingales  and  T  is  a  stopping  time  such  that  {M^n\  M^)t  converges  in 
probability  to  0  as  n  — >  oo,  then, 


lim 

n—>o o 


in  probability. 
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Notes  and  Comments 

The  book  [14]  of  Dellacherie  and  Meyer  is  again  an  excellent  reference  for 
the  topics  of  this  chapter,  in  the  more  general  setting  of  local  martingales  and 
semimartingales  with  cadlag  sample  paths.  See  also  [72]  and  [49]  (in  particular, 
a  discussion  of  the  elementary  theory  of  finite  variation  processes  can  be  found  in 
[72]).  The  notion  of  a  local  martingale  appeared  in  Ito  and  Watanabe  [43]  in  1965. 
The  notion  of  a  semimartingale  seems  to  be  due  to  Fisk  [25]  in  1965,  who  used 
the  name  “quasimartingales”.  See  also  Meyer  [60].  The  classical  approach  to  the 
quadratic  variation  of  a  continuous  (local)  martingale  is  based  on  the  Doob-Meyer 
decomposition  theorem  [58],  see  e.g.  [49].  Our  more  elementary  presentation  is 
inspired  by  [70]. 


Chapter  5 

Stochastic  Integration 


This  chapter  is  at  the  core  of  the  present  book.  We  start  by  defining  the  stochastic 
integral  with  respect  to  a  square-integrable  continuous  martingale,  considering  first 
the  integral  of  elementary  processes  (which  play  a  role  analogous  to  step  functions 
in  the  theory  of  the  Riemann  integral)  and  then  using  an  isometry  between  Hilbert 
spaces  to  deal  with  the  general  case.  It  is  easy  to  extend  the  definition  of  stochastic 
integrals  to  continuous  local  martingales  and  semimartingales.  We  then  derive  the 
celebrated  Ito’s  formula,  which  shows  that  the  image  of  one  or  several  continuous 
semimartingales  under  a  smooth  function  is  still  a  continuous  semimartingale, 
whose  canonical  decomposition  is  given  in  terms  of  stochastic  integrals.  Ito’s 
formula  is  the  main  technical  tool  of  stochastic  calculus,  and  we  discuss  several 
important  applications  of  this  formula,  including  Levy’s  theorem  characterizing 
Brownian  motion  as  a  continuous  local  martingale  with  quadratic  variation  process 
equal  to  t ,  the  Burkholder-Davis-Gundy  inequalities  and  the  representation  of 
martingales  as  stochastic  integrals  in  a  Brownian  filtration.  The  end  of  the  chapter 
is  devoted  to  Girsanov’s  theorem,  which  deals  with  the  stability  of  the  notions 
of  a  martingale  and  a  semimartingale  under  an  absolutely  continuous  change  of 
probability  measure.  As  an  application  of  Girsanov’s  theorem,  we  establish  the 
famous  Cameron-Martin  formula  giving  the  image  of  the  Wiener  measure  under 
a  translation  by  a  deterministic  function. 


5.1  The  Construction  of  Stochastic  Integrals 

Throughout  this  chapter,  we  argue  on  a  filtered  probability  space  (£2,^,  (#)),  P), 
and  we  assume  that  the  filtration  (#))  is  complete.  Unless  otherwise  specified,  all 
processes  in  this  chapter  are  indexed  by  M+  and  take  real  values.  We  often  say 
“continuous  martingale”  instead  of  “martingale  with  continuous  sample  paths”. 
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5.1.1  Stochastic  Integrals  for  Martingales  Bounded  in  L2 

We  write  H2  for  the  space  of  all  continuous  martingales  M  which  are  bounded  in 
L2  and  such  that  Mo  =  0,  with  the  usual  convention  that  two  indistinguishable 
processes  are  identified.  Equivalently,  M  e  H2  if  and  only  if  M  is  a  continuous 
local  martingale  such  that  Mo  —  0  and  E[(M,M) oo]  <  oo  (Proposition  4.13).  By 
Proposition  3.21,  if  M  e  M2,  we  have  Mt  —  E[M0 0  |  TPt\  where  M ^  e  L 2  is  the 
almost  sure  limit  of  Mt  as  t  ->  oo. 

Proposition  4.15  (v)  shows  that,  if  M,N  e  M2,  the  random  variable  (M,  A/^oo  is 
well  defined,  and  we  have  E[\  (M,  N)^  |]  <  oo.  This  allows  us  to  define  a  symmetric 
bilinear  form  on  H2  via  the  formula 

(M,A0e2  =  E[(M,N)oo\  =  E[MooNoo], 

where  the  second  equality  comes  from  Proposition  4.15  (v).  Clearly  (M,  M)e 2  =  0 
if  and  only  if  M  —  0.  The  scalar  product  (M,  N)M2  thus  yields  a  norm  on  H2  given 
by 


\\M\\B2  =  (M,M)^22  =  E[(M,M)oc]l/2  =  ^[(Moo)2]1/2. 

Proposition  5.1  The  space  H2  equipped  with  the  scalar  product  (M,N)$p.  is  a 
Hilbert  space. 

Proof  We  need  to  verify  that  the  vector  space  H2  is  complete  for  the  norm  ||  \\M2. 
Let  (Mn)n>  1  be  a  sequence  in  H2  which  is  Cauchy  for  that  norm.  We  have  then 

lim  E[(M^  -  M^)2]  =  lim  (Mn  -  Mm,  Mn  -  Mm)m 2  =  0. 

m,n—>oo  m,n— >oo 

Consequently,  the  sequence  (M7^)  converges  in  L 2  to  a  limit,  which  we  denote 
by  Z.  On  the  other  hand,  Doob’s  inequality  in  L 2  (Proposition  3.15  (ii))  and  a 
straightforward  passage  to  the  limit  show  that,  for  every  m,  n. 


m\2 


sup  (M”  -  Mf  ) 

t>0 


<  4  £[(M; 


n 

00 


We  thus  obtain  that 


lim  E 

m.n^oo 


sup  (M\ 

L  r>  0 


(5.1) 


Hence,  for  every  t  >  0,  Mnt  converges  in  L2,  and  we  want  to  argue  that  the  limit 
yields  a  process  with  continuous  sample  paths.  To  this  end,  we  use  (5.1)  to  find  an 
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increasing  sequence  %  f  o o  such  that 


oo 


< 


y>[  sup(/W"'  -  M\ 
k=  i  L 


Kk+ 1)2 


1 1/2 


<  OO. 


The  last  display  implies  that,  a.s., 


oo 


E 


sup  I M"k  -  Mn,k+' 

t>  0 


<  oo, 


and  thus  the  sequence  (M^)r>0  converges  uniformly  on  R+,  a.s.,  to  a  limit  denoted 
by  (Mt)t>Q.  On  the  zero  probability  set  where  the  uniform  convergence  does  not 
hold,  we  take  Mt  —  0  for  every  t  >  0.  Clearly  the  limiting  process  M  has  continuous 
sample  paths  and  is  adapted  (here  we  use  the  fact  that  the  filtration  is  complete). 
Furthermore,  from  the  L2 -convergence  of  (M^)  to  Z,  we  immediately  get  by  passing 
to  the  limit  in  the  identity  M"k  —  E[M[ ^  |  &t\  that  Mt  =  E[Z  \  JPt\.  Hence  (Mt)t>o 
is  a  continuous  martingale  and  is  bounded  in  L2,  so  that  M  e  M2.  The  a.s.  uniform 
convergence  of  (M^)?>0  to  (Mt)t> o  then  ensures  that  M ^  =  limMJ^  =  Z  a.s. 
Finally,  the  L2 -convergence  of  (M7^)  to  Z  =  M ^  shows  that  the  sequence  ( Mn ) 
converges  to  M  in  M2.  □ 

We  denote  the  progressive  a-field  on  £2  x  M+  by  (see  the  end  of  Sect.  3.1), 
and,  if  M  g  i2,  we  let  L2(M)  be  the  set  of  all  progressive  processes  H  such  that 


<  oo, 


with  the  convention  that  two  progressive  processes  H  and  EL’  satisfying  this 
integrability  condition  are  identified  if  Hs  —  H's  =  0,  d (M,M)S  a.e.,  a.s.  We  can 
view  L 2  (M)  as  an  ordinary  L 2  space,  namely 


L2(M)  =  E2(Q  X  R_|_,  2Z ,  dPd(M,  M)s) 


where  dPd(M,  M)s  refers  to  the  finite  measure  on  (X2  x  R+,  gZ)  that  assigns  the 
mass 


'OO 


e\  [  lA(0),s)d(M,M)i 

1  Jo 


to  a  set  A  e  SZ  (the  total  mass  of  this  measure  is  E[(M,  M) oo]  =  ||M||^2) . 

Just  like  any  L 2  space,  the  space  L 2  (M)  is  a  Hilbert  space  for  the  scalar  product 


(H,K)L  2(M)  — 


=E  / 
LJ  0 


HSKS  d(M,M) 
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and  the  associated  norm  is 


/  r  f°°  i  n\ i/2 

H\\L2(M)  =  (e[  J  H*d(M,M)s\) 


Definition  5.2  An  elementary  process  is  a  progressive  process  of  the  form 


p-i 

Hs(a>)  =  T,  H(j )  (ft>)  l(Wj+1](V), 

i=0 


where  0  —  to  <  t\  <  ^  <  •  •  •  <  tp  and  for  every  i  e  {0,  1, ...  ,p  —  1},  H(p  is  a 
bounded  -measurable  random  variable. 

The  set  £  of  all  elementary  processes  forms  a  linear  subspace  of  L2(M).  To  be 
precise,  we  should  here  say  ‘‘equivalence  classes  of  elementary  processes”  (recall 
that  H  and  Hr  are  identified  in  L2{M)  if  \\H  —  H'  ||l2(m)  —  0). 

Proposition  5.3  For  every  M  e  M2,  £  is  dense  in  L2(M). 

Proof  By  elementary  Hilbert  space  theory,  it  is  enough  to  verify  that,  if  K  e  L2(M) 
is  orthogonal  to  £,  then  K  —  0.  Assume  that  K  e  L2(M)  is  orthogonal  to  and  set, 
for  every  t  >  0, 


X,=  [  Kud(M,M)u. 

Jo 

To  see  that  the  integral  in  the  right-hand  side  makes  sense,  and  defines  a  finite 
variation  process  (Xt)t>o,  we  use  the  Cauchy-Schwarz  inequality  to  observe  that 


Ku  \  d(M,  M)u 


< 


(e[[  (Ku)2d{M,M)u]) 


1\  1/2 


x  (E[(M,M) oo])1/2- 


The  right-hand  side  is  finite  since  M  e  M2  and  K  e  L2{M),  and  thus  we  have  in 
particular 


a.s.  Wt  >  0, 


Ku\  d(M,M)u  <  oo. 


By  Proposition  4.5  (and  Remark  (i)  following  this  proposition),  (Xt)t>o  is  well 
defined  as  a  finite  variation  process.  The  preceding  bound  also  shows  that  Xt  e  L1 
for  every  t  >  0. 

LetO  <  <  t,  let  F  be  a  bounded  ^-measurable  random- variable,  and  let  H  e  £ 

be  the  elementary  process  defined  by  Hr{co)  —  F(oo)  l(s^](r).  Writing  ( H ,  K)L2^M)  = 
0,  we  get 


E 
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It  follows  that  E[F(Xt  —  2Q]  =  0  for  every  s  <  t  and  every  bounded  ^-measurable 
variable  F.  Since  the  process  X  is  adapted  and  we  know  that  Xr  e  Ll  for  every  r  >  0, 
this  implies  that  X  is  a  (continuous)  martingale.  On  the  other  hand,  X  is  also  a  finite 
variation  process  and,  by  Theorem  4.8,  this  is  only  possible  if  X  —  0.  We  have  thus 
proved  that 


t 

Ku  d(M,M)u  —  0  W>0,  a.s. 

which  implies  that,  a.s.,  the  signed  measure  having  density  Ku  with  respect  to 
d(M,  M)u  is  the  zero  measure,  which  is  only  possible  if 

Ku  =  0,  d(M,  M)u  a.e.,  a.s. 

or  equivalently  K  —  0  in  L2(M).  □ 

Recall  our  notation  XT  for  the  process  X  stopped  at  the  stopping  time  T:  Xf  — 
XtAT.  If  Me  H2,  the  fact  that  (MT,MT)0 0  =  ( M,M)t  immediately  implies  that 
Mt  also  belongs  to  H2.  Furthermore,  if  H  e  L?(M ),  the  process  l[o  t]H  defined 
by  (l[o,7]  H)s((o)  =  1  (o <s<t((i))}Hs(cjo)  also  belongs  to  L2(M)  (note  that  1[o,t]  is 
progressive  since  it  is  adapted  with  left-continuous  sample  paths). 

Theorem  5.4  Let  M  e  M2.  For  every  H  e  S’  of  the  form 


p- i 

hs(co)  =  i(,,,?j+1]d), 

1=0 

the  formula 


p- 1 

(H  ■  M),  =  J2  H{ 0  (Mti+lAt  -  MtiAt) 

1=0 

defines  a  process  H  -M  G  M2.  The  mapping  H  i-^  H  •  M  extends  to  an  isometry  from 
L2(M)  into  H2.  Furthermore,  H  •  M  is  the  unique  martingale  ofM2  that  satisfies  the 
property 


(H  •  M,N)  —  H  •  (M,  N) ,  VN  e  HI2.  (5.2) 

IfT  is  a  stopping  time,  we  have 


(1[0  j]H)  •  M  =  (H  •  M)t  —  H  •  MT . 


(5.3) 
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We  often  use  the  notation 


(H  •  M)t  —  [  HsdMs 
Jo 

and  call  H  •  M  the  stochastic  integral  ofH  with  respect  to  M. 

Remark  The  quantity  H  •  (M,  N)  in  the  right-hand  side  of  (5.2)  is  an  integral  with 
respect  to  a  finite  variation  process,  as  defined  in  Sect.  4.1.  The  fact  that  we  use  a 
similar  notation  H  •  A  and  H  •  M  for  the  integrals  with  respect  to  a  finite  variation 
process  A  and  with  respect  to  a  martingale  M  creates  no  ambiguity  since  these  two 
classes  of  processes  are  essentially  disjoint. 

Proof  As  a  preliminary  observation,  we  note  that  the  definition  of  H  •  M  when 
H  e  $  does  not  depend  on  the  decomposition  chosen  for  H  in  the  first  display  of 
the  theorem.  Using  this  remark,  one  then  checks  that  the  mapping  H  i->  H  •  M  is 
linear.  We  next  verify  that  this  mapping  is  an  isometry  from  $  (viewed  as  a  subspace 
of  L2(M))  into  H2. 

Fix  H  e  (o  of  the  form  given  in  the  theorem,  and  for  every  i  e  {0,1 1}, 
set 


T/]  —  H(i)  (Mti+lAt  —  MtjAt), 

for  every  t  >  0.  Then  a  simple  verification  shows  that  Ml  is  a  continuous  martingale 
(this  was  already  used  in  the  beginning  of  the  proof  of  Theorem  4.9),  and  that  this 
martingale  belongs  to  H2.  It  follows  that  H  •  M  —  Yfi=o  is  also  a  martingale 
in  H2.  Then,  we  note  that  the  continuous  martingales  Ml  are  orthogonal,  and  their 
respective  quadratic  variations  are  given  by 

(M‘,  M')t  =  -  (M,M)Ua,) 

(the  orthogonality  of  the  martingales  Ml  as  well  as  the  formula  of  the  last  display  are 
easily  checked,  for  instance  by  using  the  approximations  of  ( M,N }).  We  conclude 
that 


p— i 

(H-M,H-M)t  =  J2  HfdiM,  M)ti+lAt 

i= 0 


H2S  &(M,M)S. 


Consequently, 


\\H-M\\2nl 


=  E[(H  -M,H  ■  M)oo]  =  E 


H2d(M,M)s 


2 

L2(M)’ 


By  linearity,  this  implies  that  H-M  —  H'  -M  if  H'  is  another  elementary  process  that 
is  identified  with  H  in  L2(M).  Therefore  the  mapping  H  h->  H  -M  makes  sense  from 
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£  viewed  as  a  subspace  of  L2(M)  into  H2.  The  latter  mapping  is  linear,  and,  since 
it  preserves  the  norm,  it  is  an  isometry  from  £  (equipped  with  the  norm  of  L2(M)) 
into  H2.  Since  £  is  dense  in  L2(M)  (Proposition  5.3)  and  H2  is  a  Hilbert  space 
(Proposition  5.1),  this  mapping  can  be  extended  in  a  unique  way  to  an  isometry 
from  L2(M)  into  H2. 

Let  us  verify  property  (5.2).  We  fix  Af  G  H2.  We  first  note  that,  if  H  G  L2(M),  the 
Kunita-Watanabe  inequality  (Proposition  4.18)  shows  that 


Hs\\d(M,N)s 


—  \\h\\lHm)  ll^lle2  <  00 


and  thus  the  variable  /0°°  Hsd(M,N)s  —  (H  •  (M,N))oo  is  well  defined  and  in  L1. 
Consider  first  the  case  where  H  is  an  elementary  process  of  the  form  given  in  the 
theorem,  and  define  the  continuous  martingales  Ml ,  0  <  i  <  p  —  1 ,  as  previously. 
Then,  for  every  i  e  {0, 1, —  1}, 


p-  i 

(H  •  M,  N)  =  Y'W’W 

i= 0 


and  we  have 


(M',  N)t  =  H{i)((M,N)ti+lAt  -  (M,N)tiAt) 


It  follows  that 


p- 1 


(H-M,N)t  =  YH(»((M,N)ti+lAt-{M,N)tiAt)  =  f  Hs  d(M,N) 

Jo 


1=0 


which  gives  (5.2)  when  H  e  £ .  We  then  observe  that  the  linear  mapping  X  \-> 
( X ,  N)oq  is  continuous  from  M2  into  L1 .  Indeed,  by  the  Kunita-Watanabe  inequality, 

£[|(VA0oo|]  <  E[{X,X)00]1/2E[(N,N)00]1/2  =  ||iV||e2  ||X||h2. 

If  (//")„>  i  is  a  sequence  in  S’,  such  that  H"  —>  H  in  L2(M),  we  have  therefore 

(H  ■  M,N)oo  =  lim  (Hn  -M,N)oo  =  lim  (H"  ■  (M,  N})^  =  (H-(M,N))oo, 

n—>oo  n->oo 


where  the  convergences  hold  in  L1,  and  the  last  equality  again  follows  from  the 
Kunita-Watanabe  inequality  by  writing 


/ 


o° 

{Hns  -  Hs)  d (M,  N)s  <  £[(W,A^)oo]1/2  II Hn  -  H\\li(M) 
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We  have  thus  obtained  the  identity  (H  •  M,  N)oq  =  (H  •  (M,  N))oo,  but  replacing  N 
by  the  stopped  martingale  NT  in  this  identity  also  gives  (H  •  M,N)t  =  (H  •  (M,  N))t, 
which  completes  the  proof  of  (5.2). 

It  is  easy  to  see  that  (5.2)  characterizes  H  •  M  among  the  martingales  of  H2. 
Indeed,  if  X  is  another  martingale  of  H2  that  satisfies  the  same  identity,  we  get,  for 
every  N  £  H2, 


(H  •  M  —  X,N)  =  0. 


Taking  N  =  H  •  M  —  X  and  using  Proposition  4. 12  we  obtain  that  X  =  H  •  M. 

It  remains  to  verify  (5.3).  Using  the  properties  of  the  bracket  of  two  continuous 
local  martingales,  we  observe  that,  if  Af  e  H2, 

(( H-M)t,N ),  =  (H  ■  M,N)tAT  =  (H  ■  (M,N))tAT  =  (1  [0,t]  H  ■  (M ,  N))t 

which  shows  that  the  stopped  martingale  ( H-M)T  satisfies  the  characteristic  property 
of  the  stochastic  integral  (l[o  j\H)  • M .  The  first  equality  in  (5.3)  follows.  The  second 
one  is  proved  analogously,  writing 

(. H-Mt,N )  =  //•  {Mt,N)  —  H  •  (M,  N)t  =  l[0,r]  H  •  (M,N). 

This  completes  the  proof  of  the  theorem.  □ 

Remark  We  could  have  used  the  relation  (5.2)  to  define  the  stochastic  integral  H-M , 
observing  that  the  mapping  N  i-^  E[(H  •  (M,  A0)oo]  yields  a  continuous  linear  form 
on  H2,  and  thus  there  exists  a  unique  martingale  H-M  in  H2  such  that 

E[(H.(M,N))  oo]  =  2  =E[(H-M,N)  00]. 

Using  the  notation  introduced  at  the  end  of  Theorem  5.4,  we  can  rewrite  (5.2)  in 
the  form 


HsdMs, 


Hs  d(M,N)s. 


We  interpret  this  by  saying  that  the  stochastic  integral  “commutes”  with  the  bracket. 
Let  us  immediately  mention  a  very  important  consequence.  If  M  £  H2,  and  H  £ 
L2(M),  two  successive  applications  of  (5.2)  give 


(H  •  M,H  -  M)  =  H  •  (H  •  (M,  M))  =  H 2  •  (AT,  AT), 
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using  the  “associativity  property”  (4.1)  of  integrals  with  respect  to  finite  variation 
processes.  Put  differently,  the  quadratic  variation  of  the  continuous  martingale 
//•Mis 


HsdMs),  =  [  H2  d(M,M)s.  (5.4) 

Jo 

More  generally,  if  N  is  another  martingale  of  H2  and  K  £  L?(N ),  the  same  argument 
gives 


KsdNs)t=  [  HSKS  d(M,N)s.  (5.5) 

Jo 

The  following  “associativity”  property  of  stochastic  integrals,  which  is  analogous 
to  property  (4.1)  for  integrals  with  respect  to  finite  variation  processes,  is  very 
useful. 

Proposition  5.5  Let  H  £  L2{M).  If  K  is  a  progressive  process ,  we  have  KH  £ 
L2{M)  if  and  only  ifK  £  L2{H  •  M).  If  the  latter  properties  hold, 

{KH)  •  M  =  K  •  (H  •  M). 


Proof  Using  property  (5.4),  we  have 


-  Jo 


K:H2  d(M,M) 


roc 

=  E  / 

J  L  Jo 


K2  d(H  ■  M,H  ■  M) 


which  gives  the  first  assertion.  For  the  second  one,  we  write  for  N  e  H2, 

{(KH)  ■  M,N)  =  KH  ■  (M,  N)  —  K  ■  (H  ■  (M,N))  =  K-(H-M,N) 


and,  by  the  uniqueness  statement  in  (5.2),  this  implies  that  {KH)  •  M  —  K  •  {H  •  M) 


□ 


Moments  of  stochastic  integrals.  Let  M  g  H2,  N  e  H2,  H  £  L2{M)  and  K  £ 
L2{N).  Since  H  •  M  and  K  •  N  are  martingales  in  H2,  we  have,  for  every  t  £  [0,  oo], 


[/ 


E  Hs  d  Ms 


=  0 


HsKsd(M,N). 


using  Proposition  4.15  (v)  and  (5.5)  to  derive  (5.7).  In  particular, 


(5.6) 


(5.7) 


HsdMs 


o 


)2M  L 


H1  d(M,M) 


(5.8) 
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Furthermore,  since  H  •  M  is  a  (true)  martingale,  we  also  have  for  every  0  <  s  <  t  < 
oo, 


/ 


Hr  d Mr 


l 


Hr  m 


ri 


or  equivalently 


(5.9) 


/ 


HrdMr 


with  an  obvious  notation  for  f*  Hr  d Mr.  It  is  important  to  observe  that  these  formulas 
(and  particularly  (5.6)  and  (5.8))  may  no  longer  hold  for  the  extensions  of  stochastic 
integrals  that  we  will  now  describe. 


5.1.2  Stochastic  Integrals  for  Local  Martingales 


We  will  now  use  the  identities  (5.3)  to  extend  the  definition  of  H  •  M  to  an  arbitrary 
continuous  local  martingale.  If  M  is  a  continuous  local  martingale,  we  write  L2oc(M) 
(resp.  L2(M))  for  the  set  of  all  progressive  processes  H  such  that 


<  oo ,  Vf  >  0,  a.s. 


(resp.  such  that 


H]  d (M,  M)s 


<  oo). 


For  future  reference,  we  note  that  L2(M)  (with  the  same  identifications  as  in  the 
case  where  M  eW2)  can  again  be  viewed  as  an  “ordinary”  L2- space  and  thus  has  a 
Hilbert  space  structure. 

Theorem  5.6  Let  M  be  a  continuous  local  martingale.  For  every  H  e  L20C(M), 
there  exists  a  unique  continuous  local  martingale  with  initial  value  0,  which  is 
denoted  by  H  •  M,  such  that,  for  every  continuous  local  martingale  N, 


(H  •  M,  N)  =H-{M,N).  (5.10) 

IfT  is  a  stopping  time,  we  have 

(1[0, T]H)  •  M  =  (H  ■  M)T  =  H  ■  Mt.  (5.11) 

IfH  e  L[oc(M)  and  K  is  a  progressive  process,  we  have  K  e  tfoc(H  •  M)  if  and  only 
ifHK  e  Lj20C(M),  and  then 


H  ■  (K  ■  M)  —  HK  ■  M. 


(5.12) 


5.1  The  Construction  of  Stochastic  Integrals 


107 


Finally,  ifM  G  H2,  and  H  G  L2  (M),  the  definition  ofH  •  M  is  consistent  with  that  of 
Theorem  5.4. 

Proof  We  may  assume  that  Mq  =  0  (in  the  general  case,  we  write  M  —  Mo  +  M' 
and  we  just  set  H  •  M  —  H  •  M' ,  noting  that  ( M ,  N)  —  (M'  ,N)  for  every  continuous 
local  martingale  N).  Also  we  may  assume  that  the  property  //2  d  (M,M)S  <  oo 
for  every  t  >  0  holds  for  every  co  G  Q  (on  the  negligible  set  where  this  fails  we  may 
replace  H  by  0). 

For  every  n  >  1 ,  set 


Tn  =  inf {*  >  0  :  f  (1  +  //s2)  d(M,  M).s  >  n}, 

Jo 

so  that  (rn)  is  a  sequence  of  stopping  times  that  increase  to  +oo.  Since 

(. Mt\Mt"),  =  (M,M),AT„  <  n, 

the  stopped  martingale  Mr"  is  in  H2  (Theorem  4.13).  Furthermore,  we  also  have 

r  oo  i*  Tn 

/  H2s  d (MTn ,  Mt" ) s  =  /  H2d(M,M)s<n. 

Jo  Jo 

Hence,  //  g  Lr(MTn ),  and  the  definition  of  //  •  MTn  makes  sense  by  Theorem  5.4. 
Moreover,  by  property  (5.3),  we  have,  if  m  >  n, 

H  •  MTn  =  (//  •  MTm)Tn . 

It  follows  that  there  exists  a  unique  process  denoted  by  H  •  M  such  that,  for  every  n, 

C H-M)Tn  =H-MTn. 

Clearly  H  •  M  has  continuous  sample  paths  and  is  also  adapted  since  (H  •  M)t  = 
lim(//  •  MTn)t.  Since  the  processes  ( H  •  M)Tn  are  martingales  in  H2,  we  get  that  H  •  M 
is  a  continuous  local  martingale. 

Then,  to  verify  (5.10),  we  may  assume  that  A  is  a  continuous  local  martingale 
such  that  No  =  0.  For  every  n  >  1,  set  T'n  =  inf {t  >  0  :  \Nt\  >  n },  and  Sn  =  Tn  A  T'n. 
Then,  noting  that  NTn  G  M2,  we  have 

(. H  •  AT,  N)Sn  =  ((FI-  M)Tn ,  Nt * ) 

=  (H  •  MTn ,  NT'n ) 

=  H  •  {MTn ,NTn) 

=  H  •  {M,N)Sn 
=  C H-{M,N))Sn , 
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which  gives  the  equality  (H-M,N)  =  H-  ( M ,  N).  since  Sn  \  oo  as  n  ->  oo.  The  fact 
that  this  equality  (written  for  every  continuous  local  martingale  N)  characterizes 
H  •  M  among  continuous  local  martingales  with  initial  value  0  is  derived  from 
Proposition  4.12  as  in  the  proof  of  Theorem  5.4. 

The  property  (5. 1 1)  is  then  obtained  by  the  very  same  arguments  as  in  the  proof  of 
property  (5.3)  in  Theorem  5.4  (these  arguments  only  depended  on  the  characteristic 
property  (5.2)  which  we  have  just  extended  in  (5.10)).  Similarly,  the  proof  of  (5.12) 
is  analogous  to  the  proof  of  Proposition  5.5. 

Finally,  if  M  e  HI2  and  H  e  L2(M),  the  equality  (H  M,H  M)  =  H2  •  (Af,  M) 
follows  from  (5.10),  and  implies  that  H  •  M  e  H2.  Then  the  characteristic 
property  (5.2)  shows  that  the  definitions  of  Theorems  5.4  and  5.6  are  consistent.  □ 

In  the  setting  of  Theorem  5.6,  we  will  again  write 

(H  •  M)t  =  [  HsdMs. 

Jo 

It  is  worth  pointing  out  that  formulas  (5.4)  and  (5.5)  remain  valid  when  M  and 
N  are  continuous  local  martingales  and  H  e  L20C(M),  K  e  L20C  (N) .  Indeed,  these 
formulas  immediately  follow  from  (5.10). 

Connection  with  the  Wiener  integral  Suppose  that  B  is  an  (J^)- -Brownian 
motion,  and  h  e  L2(M+,  ^(M+),  d^)  is  a  deterministic  square  integrable  function. 
We  can  then  define  the  Wiener  integral  h(s)dBs  —  G(f  l[o,r])5  where  G  is  the 
Gaussian  white  noise  associated  with  B  (see  the  end  of  Sect.  2.1).  It  is  easy  to  verify 
that  this  integral  coincides  with  the  stochastic  integral  ( h  •  B)t ,  which  makes  sense 
by  viewing  h  as  a  (deterministic)  progressive  process.  This  is  immediate  when  h  is 
a  simple  function,  and  the  general  case  follows  from  a  density  argument. 

Let  us  now  discuss  the  extension  of  the  moment  formulas  that  we  stated  above 
in  the  setting  of  Theorem  5.4.  Let  M  be  a  continuous  local  martingale,  H  e  L2oc(M) 
and  t  e  [0,  oo].  Then,  under  the  condition 


(5.13) 


we  can  apply  Theorem  4. 13  to  (H  •  M)\  and  get  that  ( H  •  M)r  is  a  martingale  of  H2. 
It  follows  that  properties  (5.6)  and  (5.8)  still  hold: 


/ 


H,  dM, 


H,  AM 


n 


and  similarly  (5.9)  is  valid  for  0  <  s  <  t.  In  particular  (case  t  —  oo),  if  H  e  L2(M), 
the  continuous  local  martingale  H  •  M  is  in  H2  and  its  terminal  value  satisfies 


E 


oo 


HAM, 


)1 


H;d(M,M)s 
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If  the  condition  (5.13)  does  not  hold,  the  previous  formulas  may  fail.  However,  we 
always  have  the  bound 


E 


H,  d Af, 


f] 


(5.14) 


Indeed,  if  the  right-hand  side  is  finite,  this  is  an  equality  by  the  preceding 
observations.  If  the  right-hand  side  is  infinite,  the  bound  is  trivial. 


5.1.3  Stochastic  Integrals  for  Semimartingales 


We  finally  extend  the  definition  of  stochastic  integrals  to  continuous  semimartin¬ 
gales.  We  say  that  a  progressive  process  H  is  locally  bounded  if 


Vt  >  0, 


sup  |  Hs 

s<t 


<  00, 


a.s. 


In  particular,  any  adapted  process  with  continuous  sample  paths  is  a  locally  bounded 
progressive  process.  If  H  is  (progressive  and)  locally  bounded,  then  for  every  finite 
variation  process  V,  we  have 


Vt  >  0, 


Hs\  |dV*|  <  oo, 


a.s. 


and  similarly  H  e  L^oc  (M)  for  every  continuous  local  martingale  M. 

Definition  5.7  Let  X  be  a  continuous  semimartingale  and  let  X  =  M  +  V 
be  its  canonical  decomposition.  If  H  is  a  locally  bounded  progressive  process, 
the  stochastic  integral  H  •  X  is  the  continuous  semimartingale  with  canonical 
decomposition 


H  -X  =  H  •  M  +  H  -V, 


and  we  write 

(H-X)t=  [  HsdXs. 
Jo 


Properties 

(i)  The  mapping  (H,  X)  i-^  H  •  X  is  bilinear. 

(ii)  H  •  (K  -X)  =  ( HK )  •  X,  if  H  and  K  are  progressive  and  locally  bounded. 

(iii)  For  every  stopping  time  T,  (H  •  X)T  —  H1\qj\  •  X  =  H  •  XT . 

(iv)  If  X  is  a  continuous  local  martingale,  resp.  if  X  is  a  finite  variation  process, 
then  the  same  holds  for  H  •  X. 
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(v)  If  H  is  of  the  form  Hs ( co )  =  J2‘i=o  !(</,<,■ +1](s)>  where  0  =  r0  <  t\  < 

and,  for  every  i  e  {0, 1, . . .  ,p  —  1},  is  -measurable,  then 


p- i 

(H-X)t  =  YJHii){Xti+lM-XtiM). 

i= 0 

We  can  restate  the  “associativity”  property  (ii)  by  saying  that,  if  Yt  —  KsdXs 
then 

[  Hs  dYs  =  [  HsKsdXs. 

Jo  Jo 

Properties  (i)-(iv)  easily  follow  from  the  results  obtained  when  X  is  a  continuous 
local  martingale,  resp.  a  finite  variation  process.  As  for  property  (v),  we  first  note 
that  it  is  enough  to  consider  the  case  where  X  —  M  is  a  continuous  local  martingale 
with  Mo  =  0,  and  by  stopping  M  at  suitable  stopping  times  (and  using  (5.11)), 
we  can  even  assume  that  M  is  in  HI2.  There  is  a  minor  difficulty  coming  from  the 
fact  that  the  variables  are  not  assumed  to  be  bounded  (and  therefore  we  cannot 
directly  use  the  construction  of  the  integral  of  elementary  processes).  To  circumvent 
this  difficulty,  we  set,  for  every  n  >  1 , 


Tn  =  inf{£  >  0  :  \Ht\  >  n)  =  inf {tt  :  \H(i)\  >  n} 


(where  inf0  =  oo). 


It  is  easy  to  verify  that  Tn  is  a  stopping  time,  and  we  have  Tn  \  oo  as  n  oo. 
Furthermore,  we  have  for  every  n , 


p-  i 

i= 0 

where  the  random  variables  l{r„>o}  satisfy  the  same  properties  as  the 

s  and  additionally  are  bounded  by  n.  Hence  H  l[o,rn]  is  an  elementary  process, 
and  by  the  very  definition  of  the  stochastic  integral  with  respect  to  a  martingale  of 
M2,  we  have 


p-  i 


(H  ■  M)tATu  =  (H  l[0,Tn]  -M),  =  (Mli+l 


At 


i= 0 


The  desired  result  now  follows  by  letting  n  tend  to  infinity. 
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5.1.4  Convergence  of  Stochastic  Integrals 


We  start  by  giving  a  ‘‘dominated  convergence  theorem”  for  stochastic  integrals. 

Proposition  5.8  Let  X  —  M  +  V  be  the  canonical  decomposition  of  a  continuous 
semimartingale  X,  and  let  t  >  0.  Let  (Hn)n>\  and  H  be  locally  bounded  progressive 
processes ,  and  let  K  be  a  nonnegative  progressive  process.  Assume  that  the 
following  properties  hold  a.s.: 

(i)  Hns  — >  Hs  as  n  — >►  oo,  for  every  s  G  [0,  t\; 

(ii)  | Hns  |  <  Ks ,  for  every  n  >  1  and  s  G  [0,  t\; 

(iii)  f(Ks)2  d(M,  M)s  <  oo  and  fQKs  |dV,|  <  oo. 

Then, 


Hns  dXs 


HsdXs 


in  probability. 

Remarks 


(a)  Assertion  (iii)  holds  automatically  if  K  is  locally  bounded. 

(b)  Instead  of  assuming  that  (i)  and  (ii)  hold  for  every  s  G  [0,  t\  (a.s.),  it  is  enough  to 
assume  that  these  conditions  hold  for  d (M,  M)s- a.e.  s  g  [0,  t\  and  for  |d\4|-a.e. 
s  G  [0,  t],  a.s.  This  will  be  clear  from  the  proof. 

Proof  The  a.s.  convergence 


HsdVs 


follows  from  the  usual  dominated  convergence  theorem.  So  we  just  have  to  verify 
that  Jq  H"  dMs  converges  in  probability  to  f{}  Hs  dMs.  For  every  integer  p  >  1, 
consider  the  stopping  time 


Tp  inf{r  e  [0,  t\  :  [  ( Ksfd{M ,  M)s  >  p\  A  t, 

Jo 

and  observe  that  Tp  —  t  for  all  large  enough  p ,  a.s.,  by  assumption  (iii).  Then,  the 
bound  (5.14)  gives 


E 


H'l  d Ms  - 


Hs)2d(M,M)s 


which  tends  to  0  as  n  ->  oo,  by  dominated  convergence,  using  assumptions  (i)  and 

(ii)  and  the  fact  that  JqP ( Ks )2  d(M, M)s  <  p.  Since  P(Tp  —  f)  tends  to  1  as  p  ^  oo, 
the  desired  result  follows.  □ 
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We  apply  the  preceding  proposition  to  an  approximation  result  in  the  case  of 
continuous  integrands,  which  will  be  useful  in  the  next  section. 

Proposition  5.9  Let  X  be  a  continuous  semimartingale,  and  let  H  be  an  adapted 
process  with  continuous  sample  paths.  Then,  for  every  t  >  0,  for  every  sequence 
0  =  /q  <  •  •  •  <  tnpn  —  t  of  subdivisions  of[ 0,  t\  whose  mesh  tends  to  0,  we  have 

Pn~  i 

lim  ,  -Xfi)  = 

>00  *  J  1  l'[  1 
i= 0 

in  probability. 

Proof  For  every  n  >  1 ,  define  a  process  Hn  by 

(  Htn  if  f!  <  s  <  fi+v  for  every  i  e  {0, 1,  ...,pn  -  1} 

Hns  =  \  Ho  if  j  =  0 
[  0  if  s  >  t. 

Note  that  Hn  is  progressive.  We  then  observe  that  all  assumptions  of  Proposition  5.8 
hold  if  we  take 


Ks  —  max  |  H, 

0<r<5 


which  is  a  locally  bounded  progressive  process.  Hence,  we  conclude  that 


HnsdXs 


HsdXs 


in  probability.  This  gives  the  desired  result  since,  by  property  (v)  in  Sect.  5.1.3,  we 
have 


Hns  dXs  = 


Pn~  1 


i= 0 


□ 

Remark  The  preceding  proposition  can  be  viewed  as  a  generalization  of 
Lemma  4.3  to  stochastic  integrals.  However,  in  contrast  with  that  lemma,  it  is 
essential  in  Proposition  5.9  to  evaluate  H  at  the  left  end  of  the  interval  (fl,  ^+1]: 
The  result  will  fail  if  we  replace  Hq  by  Hf}  .  Let  us  give  a  simple  counterexample. 
We  take  Ht  —  Xt  and  we  assume  that  the  sequence  of  subdivisions  (t^)o<i<pn  is 
increasing.  By  the  proposition,  we  have 


Pn~  1 

lim  )  Xtn  (Xtn 

n->oo  1  l' 


+  1 


i=  0 


XsdXs, 
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in  probability.  On  the  other  hand,  writing 


Pn~  1  Pn~  1 

E^+.^r+i  -*?)  =  EW1- 

i=0  i=0 

and  using  Proposition  4.21,  we  get 


Pn~  1 

^)  +  E^'+1 


i=0 


Pn  1  />  t 

lim  Vx,»  ,(X,»  -X*)  =  /  X,dX,+  <X,X)„ 

w->00  '"T1  ‘"I"1  J  /n 

i=0  JU 

in  probability.  The  resulting  limit  is  different  from  X5dXy  unless  the  martingale 
part  of  X  is  degenerate.  Note  that,  if  we  add  the  previous  two  convergences,  we 
arrive  at  the  formula 


f 


X,dX,  + 


(X.X), 


which  is  a  special  case  of  Ito’s  formula  of  the  next  section. 


5.2  Ito’s  Formula 

Ito’s  formula  is  the  cornerstone  of  stochastic  calculus.  It  shows  that,  if  we  apply  a 
twice  continuously  differentiable  function  to  a  p-tuple  of  continuous  semimartin¬ 
gales,  the  resulting  process  is  still  a  continuous  semimartingale,  and  there  is  an 
explicit  formula  for  the  canonical  decomposition  of  this  semimartingale. 

Theorem  5.10  (Ito’s  formula)  Let  X1, ...  ,XP  be  p  continuous  semimartingales, 
and  let  F  be  a  twice  continuously  differentiable  real  function  on  Then,  for  every 
t  >  0, 


p 


i=  1 

i  P  pt  ff-p 

+2^1  My(x! . 

ij=  1 

Proof  We  first  deal  with  the  case  p  —  1  and  we  write  X  —  X1  for  simplicity.  Fix 
t  >  0  and  consider  an  increasing  sequence  0  =  ^  <  •  •  •  <  f  —  t  of  subdivisions 
of  [0,  t\  whose  mesh  tends  to  0.  Then,  for  every  n , 


F{Xln...X)=F{Xl...,X^)  +  YJ 


Pn~  1 

F(Xt)  =  F(X 0)  +  X(F(X‘"P  ~  F{X 

1=0 
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For  every  i  e  {0, 1,  —  1},  we  apply  the  Taylor-Lagrange  formula  to  the 

function  [0, 1]  9  0  i->  F(Xt*  +  0(X^  —  Xt»)),  between  0—0  and  0  =  1,  and  we 
get  that 


F(X$  )  -  FiX,*)  =  F' iX(;){X,*+i  -Xfj)  + 


where  the  quantity  fnj  can  be  written  as  F" (Xt«  +  c{Xf .  —Xt»))  for  some  c  e  [0,1]. 
By  Proposition  5.9  with  Hs  =  F'(XS),  we  have 


Pn  1  /»f 

lim  YF'(Xtn)(Xfr  -Xf;)=  /  F'(Xs)dXs, 

n-^o o  1  1~f1  1 

i=0  0 

in  probability.  To  complete  the  proof  of  the  case  p  =  1  of  the  theorem,  it  is  therefore 
enough  to  verify  that 


n 


Pn~ 1 

lim 

— FOO 

i= 0 


i+1 


F"(X5)d(X,X)5, 


in  probability.  We  observe  that 


(5.15) 


sup  \fn,i  -  F" (X,n)  |<  sup 


sup 


F"(x) 


o  <i<pn—l 


0< /'</?/, —  1  \  X^[Xji  AXji  ,X,n  VX.n  | 

i  ;+ 1  i  ;+ 1 


The  right-hand  side  of  the  preceding  display  tends  to  0  a. s.  as  n  ->  oo,  as  a  simple 
consequence  of  the  uniform  continuity  of  F "  (and  of  the  sample  paths  of  X)  over  a 
compact  interval. 

Since  (Xtn  —  Xp)2  converges  in  probability  (Proposition  4.21),  it  follows 

‘  J  l  'J  i~\~  1  t 

from  the  last  display  that 


Pn~  1  Pn~  1 

«■(*?+,  -^r)2- 

i=0  z=0 


■>  0 


n—Foo 


in  probability.  So  the  convergence  (5.15)  will  follow  if  we  can  verify  that 


Pn  1  />  f 

lim  y^F”iXfl)iX(;  -X A2  =  /  F"(XS)  d{X,X)s,  (5.16) 

i= 0 

in  probability.  In  fact,  we  will  show  that  (5.16)  holds  a.s.  along  a  suitable  sequence 
of  values  of  n  (this  suffices  for  our  needs,  because  we  can  replace  the  initial  sequence 
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of  subdivisions  by  a  subsequence).  To  this  end,  we  note  that 


Pn~  1  » 

E  ^"(%)(^+1  -  -  /  F"(X,)  \Ln 

i= o  ‘'M 


(<k), 


where  /zn  is  the  random  measure  on  [0,  t]  defined  by 


Pn~  1 

M«(dr)  :=  y](X(«+|  -X,»)2fy(dr). 

i=0 

Write  D  for  the  dense  subset  of  [0,  t]  that  consists  of  all  fj  for  n>  1  and  0  <  i  <  pn. 
As  a  consequence  of  Proposition  4.21,  we  get  for  every  r  e  D, 

M„([0,r])  — ►  (X,  X)r 

w— >oo 

in  probability.  Using  a  diagonal  extraction,  we  can  thus  find  a  subsequence  of  values 
of  n  such  that,  along  this  subsequence,  we  have  for  every  r  e  D, 

/z„([0,r])  (X,X) 

«->oo  r 


which  implies  that  the  sequence  \in  converges  a.s.  to  the  measure  l[o,/](/)  d(X,X)r, 
in  the  sense  of  weak  convergence  of  finite  measures.  We  conclude  that  we  have 


a.s. 

- > 

«->oo 


F"(XS)  d(X,X)s 


along  the  chosen  subsequence.  This  completes  the  proof  of  the  case  p  —  1 . 

In  the  general  case,  the  Taylor-Lagrange  formula,  applied  for  every  n  >  1  and 
every  i  e  {0, 1,  —  1}  to  the  function 


[0, F(Xi  +  0(Xi  -  Xi), . . . ,  X?  +  0(Xp 

i  ;+ 1  i  /' 


+  1 


gives 


F(Xl 

i+\ 


,XP,  )-F(X},,. 

1  i 


,Xpt) 


P  \  _ 


P 

. -*i?) 


£=1 


i+l 


+  E 

fc,/=l 
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where,  for  every  k,l  e  { 1, . . .  ,p }, 


-fk,l  _ 

J  n,i 


d2F 
dxi  (Lx, 


(Xe+c(X«  -Xg)), 


'  + 1 


for  some  c  G  [0,1]  (here  we  use  the  notation  Xt  —  (Xj , . . . ,  Xpt )). 

Proposition  5.9  can  again  be  used  to  handle  the  terms  involving  first  derivatives. 
Moreover,  a  slight  modification  of  the  arguments  of  the  case  p  —  1  shows  that,  at 
least  along  a  suitable  sequence  of  values  of  n ,  we  have  for  every  k,l  e  {1 


Pn  1  nt  a2i7 

iim  V Zft’l&k  -  4x4+  -  4)  =  /  (V1  -  •  •  •  -  V)  d(V, xl) 

1+1  '  1+1  '  JO  dxkdxi 


in  probability.  This  completes  the  proof  of  the  theorem.  □ 

An  important  special  case  of  Ito’s  formula  is  the  formula  of  integration  by 
parts,  which  is  obtained  by  taking  p  —  2  and  F(x,  y)  =  xy:  if  X  and  Y  are  two 
continuous  semimartingales,  we  have 

f  nt 

Xs  dFy  +  /  YsdXs  +  (X,Y)t. 

Jo 


XtYt  =  X0Y0  + 


L 


In  particular,  if  Y  —  X, 


xj  =  X2  +  2  [  Xs  dXs  +  {X,  X)t. 

JO 

When  X  —  M  is  a  continuous  local  martingale,  we  know  from  the  definition  of  the 
quadratic  variation  that  M1  —  (M,  M }  is  a  continuous  local  martingale.  The  previous 
formula  shows  that  this  continuous  local  martingale  is 

M2  +  2  [  MsdMs. 

Jo 

We  could  have  seen  this  directly  from  the  construction  of  (M,  M)  in  Chap.  4  (this 
construction  involved  approximations  of  the  stochastic  integral  /0rMydMy). 

Let  B  be  an  (J^)-real  Brownian  motion  (recall  from  Definition  3.11  that  this 
means  that  B  is  a  Brownian  motion,  which  is  adapted  to  the  filtration  (JFt)  and  such 
that,  for  every  0  <  s  <  t,  the  variable  Bt  —  Bs  is  independent  of  the  a -field  &s).  An 
(J^)- Brownian  motion  is  a  continuous  local  martingale  (a  martingale  if  Bo  e  L[) 
and  we  already  noticed  that  its  quadratic  variation  is  (B,  B)t  —  t. 

In  this  particular  case,  Ito’s  formula  reads 

F(Bt )  =  F(B0)  +  [  F\Bs)dBs  +  \  [  F"(BS) ds. 

Jo  J  Jo 
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Taking  Xj  =  t,  X2  —  Bt,  we  also  get  for  every  twice  continuously  differentiable 
function  F(t ,  v)  on  M+  x  R, 


G  dF  G  dF  1  d2F 

F(t, Bt)  =  F(0,Bo)  +  J  —(s,Bs)dBs-\-  J  Bs)  ds. 

Let  Bt  =  (B) , . . . ,  Bd)  be  a  d-dimensional  (^) -Brownian  motion.  Note  that  the 
components Bl, ...  ,Bd  are  (J^) -Brownian motions.  By  Proposition 4. 16,  (Bl, BJ)  — 
0  when  i  ^  j  (by  subtracting  the  initial  value,  which  does  not  change  the  bracket 
(. Bl ,  B-i) ,  we  are  reduced  to  the  case  where  Bl, ...  ,Bd  are  independent).  Ito’s  formula 
then  shows  that,  for  every  twice  continuously  differentiable  function  F  on 


=  fK 


X)  + 


d  n\ 

u 


3  F 
3  xi 


1 

-J  af(b\ , 


B' )  dv. 


The  latter  formula  is  often  written  in  the  shorter  form 

F(Bt)  =  F(B0)  +  f  V F (Bs)  •  d Bs  +  \  [  AF(BS)  ds, 

Jo  z  Jo 

where  VF  stands  for  the  vector  of  first  partial  derivatives  of  F.  There  is  again  an 
analogous  formula  for  F(t,  Bt). 

Important  remark  It  frequently  occurs  that  one  needs  to  apply  Ito’s  formula  to 
a  function  F  which  is  only  defined  (and  twice  continuously  differentiable)  on  an 
open  subset  U  of  W.  In  that  case,  we  can  argue  in  the  following  way.  Suppose 
that  there  exists  another  open  set  V,  such  that  (Xq,  . . .  ,X%)  e  V  a.s.  and  V  c  U 
(here  V  denotes  the  closure  of  V).  Typically  V  will  be  the  set  of  all  points  whose 
distance  from  Uc  is  strictly  greater  than  s,  for  some  e  >  0.  Set  Ty  :=  inf{t  > 
0  :  (Xj , . . . ,  Xpt  )  V},  which  is  a  stopping  time  by  Proposition  3.9.  Simple 

analytic  arguments  allow  us  to  find  a  function  G  which  is  twice  continuously 
differentiable  on  W  and  coincides  with  F  on  V.  We  can  now  apply  Ito’s  formula  to 
obtain  the  canonical  decomposition  of  the  semimartingale  G(XjATy, . . . ,  XptATy)  — 
F(XjATy, . . .  ,XptATy),  and  this  decomposition  only  involves  the  first  and  second 
derivatives  of  F  on  V.  If  in  addition  we  know  that  the  process  (X) , . . . ,  Xpt )  a.s.  does 
not  exit  U,  we  can  let  the  open  set  V  increase  to  U,  and  we  get  that  Ito’s  formula  for 
F(X] , . . . ,  Xpt  )  remains  valid  exactly  in  the  same  form  as  in  Theorem  5.10.  These 
considerations  can  be  applied,  for  instance,  to  the  function  F(x)  =  logv  and  to  a 
semimartingale  X  taking  strictly  positive  values:  see  the  proof  of  Proposition  5.21 
below. 
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We  now  use  Ito’s  formula  to  exhibit  a  remarkable  class  of  (local)  martingales, 
which  extends  the  exponential  martingales  associated  with  processes  with  indepen¬ 
dent  increments.  A  random  process  with  values  in  the  complex  plane  C  is  called  a 
complex  continuous  local  martingale  if  both  its  real  part  and  its  imaginary  part  are 
continuous  local  martingales. 

Proposition  5.11  Let  M  be  a  continuous  local  martingale  and,  for  every  A  e  C,  let 

/  X2  \ 

S{XM)t  =  exp  [XMt  -  —{M,Af)ty 

The  process  S  (AM)  is  a  complex  continuous  local  martingale,  which  can  be  written 
in  the  form 


£  (AM),  =  eXM°  +  A 


l 


S  (AM) , 


Remark  The  stochastic  integral  in  the  right-hand  side  of  the  last  display  is  defined 
by  dealing  separately  with  the  real  and  the  imaginary  part. 

Proof  If  F(r,  v)  is  a  twice  continuously  differentiable  function  on  M2,  Ito’s  formula 
gives 


C  dF 

F((M,  M)t,Mt)  =  F(0,M0)  +  /  —  ((M,M)S,MS)  dMs 

Jo  vx 

/dF  1  32F\  #  x  N  # 

+  J  (—  +  -~^)((M,M)s,Ms)d(M,M)s. 

Hence,  F((M,M)t,Mt )  is  a  continuous  local  martingale  as  soon  as  F  satisfies  the 
equation 


dF  1  d2F  _ 
dr  +  2  dx2 

This  equation  holds  for  F(r,x )  =  exp(Av  -  yr)  (more  precisely  for  both  the  real 
and  the  imaginary  part  of  this  function).  Moreover,  for  this  choice  of  F  we  have 
ff  —  XF,  which  leads  to  the  formula  of  the  statement.  □ 


5.3  A  Few  Consequences  of  Ito’s  Formula 

Ito’s  formula  has  a  huge  number  of  applications.  In  this  section,  we  derive  some  of 
the  most  important  ones. 
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5.3.1  Levy’s  Characterization  of  Brownian  Motion 

We  start  with  a  striking  characterization  of  real  Brownian  motion  as  the  unique 
continuous  local  martingale  M  such  that  ( M,M)t  =  t.  In  fact,  we  give  a 
multidimensional  version  of  this  result,  which  is  known  as  Levy’s  theorem. 

Theorem  5.12  Let  X  —  ( Xl,...,Xd )  be  an  adapted  process  with  continuous 
sample  paths.  The  following  are  equivalent: 

(i)  X  is  a  d-dimensional  (J^)- -Brownian  motion. 

(ii)  The  processes  X1, ...  ,Xd  are  continuous  local  martingales ,  and  (Xl,X^)t  —  8y  t 
for  every  ij  £  {1 , ...  ,d)  (here  8y  is  the  Kronecker  symbol,  8y  —  1  {/=/})• 

In  particular,  a  continuous  local  martingale  M  is  an  (^t) -Brownian  motion  if  and 
only  if  ( M,M)t  —  t,  for  every  t  >  0,  or  equivalently  if  and  only  if  Mj  —  t  is  a 
continuous  local  martingale. 

Proof  The  fact  that  (i)  =>>  (ii)  has  already  been  derived.  Let  us  assume  that  (ii) 
holds.  Let  £  =  (£i, . . . ,  £j)  C  Rd.  Then,  £  •  Xt  =  Ylj=  l  is  a  continuous  local 

martingale  with  quadratic  variation 

d  d 

j=  1  k=  1 

By  Proposition  5.1 1,  exp(i£  •  Xt  +  ^  |£| 2t)  is  a  complex  continuous  local  martingale. 
This  complex  continuous  local  martingale  is  bounded  on  every  interval  [0,  a],  a  >  0, 
and  is  therefore  a  (true)  martingale,  in  the  sense  that  its  real  and  imaginary  parts  are 
both  martingales.  Hence,  for  every  0  <  s  <  t, 

E[exp(i£  -X,  +  l|£|2f)  I  &A  =  exp(i£  •  Xs  + 


and  thus 


£[exp(i^  •  ( X t  -  Xs )  |  &s]  =  exp(— l|^|2(f  -  s)). 
It  follows  that,  for  every  A  e 


E[1a  exp(i^  •  (X,  -  X,))]  =  P(A)  exp(— l|^|2(f  -  s)). 

Taking  A  =  T2,  we  get  that  Xt  —  Xs  is  a  centered  Gaussian  vector  with  covariance 
matrix  ( t  —  y)Id  (in  particular,  the  components  XJt  —  Xf  1  <j<d  are  independent). 
Furthermore,  fix  A  e  with  P(A)  >  0,  and  write  Pa  for  the  conditional  probability 
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measure  Pa()  —  P(A)  lP(-  D  A).  We  also  obtain  that 

PA[exp(i£  •  (X,  -  Xs))]  =  exp(-l|^|2(r  -  s)) 

which  means  that  the  law  of  Xt  —  Xs  under  Pa  is  the  same  as  its  law  under  P. 
Therefore,  for  any  nonnegative  measurable  function/  on  M/  we  have 


PA[f(Xt-Xs)]=E\f(Xt-Xs)], 


or  equivalently 


E[lAf(Xt  -  Xs )]  =  P(A)  E\f(Xt  -  Xs)]. 

This  holds  for  any  A  e  (when  P(A)  =  0  the  equality  is  trivial),  and  thus  Xt  —  Xs 
is  independent  of  &s. 

It  follows  that,  if  to  —  0  <  h  tp  y  the  vectors  Xtl  —  Xt0,Xt2  — 

Xtl, ... ,  Xtp  —  Xtp_l  are  independent.  Since  the  components  of  each  of  these  vectors 

are  independent  random  variables,  we  obtain  that  all  variables  XJtk  —  XJtk_  -pi  < 

j  <  d,  1  <  k  <  p  are  independent,  and  XJtjc  —  XJtk_x  is  distributed  according  to 
yK(0,  tk—tk-i ).  This  implies  thatX— Xo  is  a  d-dimensional  Brownian  motion  started 
from  0.  Since  we  also  know  that  X— Xo  is  independent  of  Xo  (as  an  easy  consequence 
of  the  fact  that  Xt  —  Xs  is  independent  of  for  every  0  <  s  <  t),  we  get  that  X 
is  a  d-dimensional  Brownian  motion.  Finally,  X  is  adapted  and  has  independent 
increments  with  respect  to  the  filtration  (J^)  so  that  X  is  a  d-dimensional  (J^)- 
Brownian  motion.  □ 


5.3.2  Continuous  Martingales  as  Time-Changed  Brownian 
Motions 

The  next  theorem  shows  that  any  continuous  local  martingale  M  can  be  written  as  a 
“time-changed”  Brownian  motion  (in  fact,  we  prove  this  only  when  (M,  M)^  =  oo, 
but  see  the  remarks  below).  It  follows  that  the  sample  paths  of  M  are  Brownian 
sample  paths  run  at  a  different  (varying)  speed,  and  certain  almost  sure  properties  of 
sample  paths  of  M  can  be  deduced  from  the  corresponding  properties  of  Brownian 
sample  paths.  For  instance,  under  the  condition  (M,M) ^  =  oo,  the  sample  paths 
of  M  must  oscillate  between  +oo  and  -oo  as  /  ->  oo  (cf.  the  last  assertion  of 
Proposition  2.14). 
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Theorem  5.13  (Dambis-Dubins-Schwarz)  Let  M  be  a  continuous  local  martin¬ 
gale  such  that  (M,  M}^  —  oo  a.s.  There  exists  a  Brownian  motion  (fis)s> o  such 
that 

a.s.  Vr  >  0,  M,  =  P<mm>i. 


Remarks 

(i)  One  can  remove  the  assumption  (M,  M)^  =  oo,  at  the  cost  of  enlarging  the 
underlying  probability  space,  see  [70,  Chapter  V]. 

(ii)  The  Brownian  motion  ft  is  not  adapted  with  respect  to  the  filtration  (J^),  but 
with  respect  to  a  “time-changed”  filtration,  as  the  following  proof  will  show. 

Proof  We  first  assume  that  M0  =  0.  For  every  r  >  0,  we  set 

xr  =  inf{7  >  0  :  ( M,M)t  >  r}. 

Note  that  rr  is  a  stopping  time  by  Proposition  3.9.  Furthermore,  we  have  xr  <  oo 
for  every  r  >  0,  on  the  event  {{M,  M)^  —  oo}.  It  will  be  convenient  to  redefine  the 
variables  xr  on  the  (negligible)  event  ^  <  oo}  by  taking  r r(co)  —  0 

for  every  r  >  0  if  co  e  .  Since  the  filtration  is  complete,  xr  remains  a  stopping 
time  after  this  modification. 

By  construction,  for  every  co  e  T2,  the  function  r  i->  xr(co)  is  nondecreasing  and 
left-continuous,  and  therefore  has  a  right  limit  at  every  r  >  0.  This  right  limit  is 
denoted  by  rr+  and  we  have 

Tr+  =  inf{t  >  0  :  ( M,M)t  >  r}, 

except  of  course  on  the  negligible  set  JT ,  where  rr+  =  0. 

We  set  pr  =  MZr  for  every  r  >  0.  By  Theorem  3.7,  the  process  (/3r)r> o  is  adapted 
with  respect  to  the  filtration  (&r)  defined  by  for  every  r  >  0,  and  = 

^oo.  Note  that  the  filtration  (£fr)  is  complete  since  this  property  holds  for  (^t). 

The  sample  paths  r  i->-  Pr(co)  are  left-continuous  and  have  right  limits  given  for 
every  r  >  0  by 


Pr+  =  lim  =  MZr+ . 

In  fact  we  have  for  every  r  >  0,  a.s.,  as  a  consequence  of  the  following 

lemma. 

Lemma  5.14  We  have  a.s.  for  every  0  <  a  <  b, 

Mt  =  Ma,  Vt  e  [a,b\  (M,  M)b  =  {M,M)a. 

Let  us  postpone  the  proof  of  the  lemma.  Since  (M,  M)  Tr  =  (M,  M )  Tr+  for  every 
r  >  0,  Lemma  5.14  implies  that  MZr  =  MZr+ ,  for  every  r  >  0,  a.s.  Hence  the  sample 
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paths  of  /3  are  continuous  (to  be  precise,  we  should  redefine  /3r  —  0,  for  every  r  >  0, 
on  the  zero  probability  set  where  the  property  of  Lemma  5.14  fails). 

Let  us  verify  that  /3S  and  fi2  —  s  are  martingales  with  respect  to  the  filtration 
(&s) .  For  every  integer  n  >  1 ,  the  stopped  continuous  local  martingales  MTn  and 
( MTn )2  —  (M,  M)Tn  are  uniformly  integrable  martingales  (by  Theorem  4. 13,  recalling 
that  Mo  —  0  and  noting  that  (MTn ,  MZn ) oo  =  (M,M)In  —  n  a.s.).  The  optional 
stopping  theorem  (Theorem  3.22)  then  implies  that,  for  every  0  <  r  <  s  <  n. 


E\P,  |  Sfr]  =  mi  I  =  Ml  =  P 


and  similarly 


E[p2-s  |  &r]  =  E[(Ml)2-(MTn,MTn)zs  I  & rr ]  =  (Ml)2  -  (MXn ,  MXn)  xr  =  Pl~r 


Then  the  case  d  —  1  of  Theorem  5.12  shows  that  /3  is  a  (£fr) -Brownian  motion. 
Finally,  by  the  definition  of  /3,  we  have  a.s.  for  every  t  >  0, 


=  M 


But  since  r<MM>t  —  t  —  T<M,M>t+  an<^  since  (M >  takes  the  same  value  at  r<MM>t 
and  at  Lemma  5.14  shows  that  Mt  —  MT<MM>t  for  every  t  >  0,  a.s.  We 

conclude  that  we  have  Mt  —  P<MM>t  f°r  every  t  >  0,  a.s.  This  completes  the  proof 
when  Mo  —  0. 

If  Mo  ^  0,  we  write  Mt  —  Mo  +  M',  and  we  apply  the  previous  argument  to  M' , 
in  order  to  get  a  Brownian  motion  /F  with  /3'0  =  0,  such  that  M't  —  /3f<M,  Mf>  for 
every  t  >  0  a.s.  Since  j3f  is  a  (^r)-Brownian  motion,  ft  is  independent  of  %  = 
hence  of  M0.  Therefore,  /3S  =  Mo  +  /3's  is  also  a  Brownian  motion,  and  we  get  the 
desired  representation  for  M.  □ 

Proof  of  Lemma  5.14  Thanks  to  the  continuity  of  sample  paths  of  M  and  (M,  M), 
it  is  enough  to  verify  that  for  any  fixed  a  and  b  such  that  0  <  a  <  b,  we  have 

{Mt  =  Ma,  Vt  e  [a,b]}  =  {{M,  M)b  =  (M,  M)a}  ,  a.s. 

The  fact  that  the  event  in  the  left-hand  side  is  (a.s.)  contained  in  the  event  in  the 
right-hand  side  is  easy  from  the  approximations  of  (M,  M)  in  Theorem  4.9. 

Let  us  prove  the  converse.  Consider  the  continuous  local  martingale  Nt  —  Mt  — 
MtAa  and  note  that 


(N,N)t=  {M,M)t-{M,M)tAa. 
For  every  s  >  0,  introduce  the  stopping  time 

Ts  —  inf{t  >  0  :  (N,N)t  >  s}. 
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Then  NTs  is  a  martingale  in  H2  (since  {NTs  ,NTs)00  <  s).  Fix  t  e  [a,  b\.  We  have 

E[N?aT ■]  =  £[(^,  A0,Ar.]  <  e. 

Hence,  considering  the  event  A  :  =  {(M,M)b  =  ( M,M)a }  C  {Ts  >  b }, 

£[U^Vf2]  =  E[lAN?ATe]  <  £[<rJ  <  e. 

By  letting  s  go  to  0,  we  get  E[1aN^ ]  =  0  and  thus  Nt  =  0  a.s.  on  A,  which  completes 
the  proof.  □ 

We  can  combine  the  arguments  of  the  proof  of  Theorem  5.13  with  Theorem  5.12 
to  get  the  following  technical  result,  which  will  be  useful  when  we  consider  the 
image  of  planar  Brownian  motion  under  holomorphic  transformations  in  Chap.  7. 

Proposition  5. 15  Let  M  and  N  be  two  continuous  local  martingales  such  that  Mq  = 
No  =  0.  Assume  that 

(i)  ( M,M)t  —  (N,N)t  for  every  t  >  0,  a.s. 

(ii)  M  and  N  are  orthogonal  ( ( M ,  N)t  —  0  for  every  t  >  0,  a.s. ) 

(iii)  (M,M)  oo  =  (W  AOoo  =  oo, 

P  =  (pt)t> o,  resp.  y  =  o,  be  the  real  Brownian  motion  such  Mt  —  f$(MM)t> 
resp.  Nt  —  Y{N,N)t>for  every  t  >  0,  a.i1.  f)  and  y  are  independent. 

Proof  We  use  the  notation  of  the  proof  of  Theorem  5.13  and  note  that  we  have 
Pr  —  MZr  and  yr  =  Aftr,  where 

tr  =  inf{t  >  0  :  (M,M)t  >  r)  =  inf{f  >  0  :  (2V, 2V)f  >  r}. 

We  know  that  f)  and  y  are  (£fr) -Brownian  motions.  Since  M  and  are  orthogonal 
martingales,  we  also  know  that  MtNt  is  a  local  martingale.  As  in  the  proof  of 
Theorem  5.13,  and  using  now  Proposition  4.15  (v),  we  get  that,  for  every  n  >  1, 
MfnNfn  is  a  uniformly  integrable  martingale,  and  by  applying  the  optional  stopping 
theorem,  we  obtain  that  for  r  <  s  <  n. 


EWsYs  I  %\  =  £[MI'W 


^77 

Ts 


so  that  /3ryr  is  a  (^r) -martingale  and  the  bracket  (/3,  y)  (evaluated  in  the  filtration 
(£fr))  is  identically  zero.  By  Theorem  5.12,  it  follows  that  (/3,  y)  is  a  two- 
dimensional  Brownian  motion  and,  since  /3o  =  yo  =  0,  this  implies  that  /3  and 
y  are  independent.  □ 
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5.3.3  The  Burkholder-Davis-Gundy  Inequalities 

We  now  state  important  inequalities  connecting  a  continuous  local  martingale  with 
its  quadratic  variation.  If  M  is  a  continuous  local  martingale,  we  set 

M*  —  sup  \MS\ 

s<t 


for  every  t  >  0.  Theorem  5.16  below  shows  that,  under  the  condition  Mo  =  0, 
for  every  p  >  0,  the  p- th  moment  of  M*  is  bounded  above  and  below  (up  to 

universal  multiplicative  constants)  by  the  p- th  moment  of  y/ ( M ,  M)t.  These  bounds 
are  very  useful  because,  in  particular  when  M  is  a  stochastic  integral,  it  is  often 
easier  to  estimate  the  moments  of  y/(M,  M)t  than  those  of  M* .  Such  applications 
arise,  for  instance,  in  the  study  of  stochastic  differential  equations  (see  e.g.  the  proof 
of  Theorem  8.5  below). 

Theorem  5.16  (Burkholder-Davis-Gundy  inequalities)  For  every  real  p  >  0, 
there  exist  two  constants  cp,  Cp  >  0  depending  only  on  p  such  that,  for  every 
continuous  local  martingale  M  with  Mq  =  0,  and  every  stopping  time  T, 

cp  E[(M,  M)Pj2]  <  E[{M*)P]  <  CpE[(M,M)pT/2]. 

Remark  It  may  happen  that  the  quantities  E[(M, M)PT  2]  and  are  infinite. 

The  theorem  says  that  these  quantities  are  either  both  finite  (then  the  stated  bounds 
hold)  or  both  infinite. 

Proof  Replacing  M  by  the  stopping  martingale  Mr,  we  see  that  it  is  enough  to  treat 
the  special  case  T  —  oo.  We  then  observe  that  it  suffices  to  consider  the  case  when 
M  is  bounded:  Assuming  that  the  bounded  case  has  been  treated,  we  can  replace  M 
by  Mt,\  where  Tn  —  inf{t  >  0  :  \Mt\  —  n },  and  we  get  the  general  case  by  letting  n 
tend  to  oo. 

The  left-hand  side  inequality,  in  the  case  p  >  4,  follows  from  the  result  of 
question  4.  in  Exercise  4.26.  We  prove  below  the  right-hand  side  inequality  for  all 
values  of  p.  This  is  the  inequality  we  will  use  in  the  sequel  (we  refer  to  [70,  Chapter 
IV]  for  the  remaining  case). 

We  first  consider  the  case  p  >  2.  We  apply  Ito’s  formula  to  the  function  \x\p : 

\M,\p=  l  p\Msrlsgn(Ms)dMs+1-  [  p(p  —  l)\Ms\p~2  d(M,M)s. 

Jo  ^  Jo 

Since  M  is  bounded,  hence  in  particular  M  e  H2,  the  process 

t 

p\Msr'sgn(Ms)dMs 
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is  a  martingale  in  H2.  We  therefore  get 


E[\Mt\P]  =  f  \MS\P~2  d(M,  M) 

'-Jo 


< 


< 


2 

P(P~  1) 
2 

Pip  -  1) 


E[(My~2(M,M)t\ 


(E[(My])^~2)/p(E[(M,  M)pt/2])2/p , 


by  Holder’s  inequality.  On  the  other  hand,  by  Doob’s  inequality  in  Lp  (Proposi¬ 
tion  3.15), 


E[(M*f]  <  (-P—J E[\Mt\P] 


and  combining  this  bound  with  the  previous  one,  we  arrive  at 


Em;y]  <  ((^-j-)P  p(p2  1})P/2£[(M,Mf; 


It  now  suffices  to  let  t  tend  to  oo. 

Consider  then  the  case  p  <  2.  Since  M  g  H2,  M 2  —  ( M ,  M)  is  a  uniformly 
integrable  martingale  and  we  have,  for  every  stopping  time  T , 


£[(Mr)2]  =  E[{M,M)t]- 


Let  x  >  0  and  consider  the  stopping  time  Tx  :=  inf{f  >  0  :  ( M ,)2  >  x}.  Then,  if  T 
is  any  bounded  stopping  time, 


>  x)  =  P{TX  <  T)  =  P((MTxat)2  >  x)  <  -E{(MTxAT)2] 

X 

=  -E[{M,M)txat] 

X 

<  -E[{M,M)t\. 
x 

Next  consider  the  stopping  time  Sx  :=  inf{t  >  0  :  ( M,M)t  >  x}.  Observe  that, 
for  every  t  >  0,  we  have  {(M*)2  >  v}  C  ({(^|vAr)2  >  x}  U  {Sx  <  f}).  Using  the 
preceding  bound  with  T  =  Sx  At,  we  thus  get 

P((Mf*)2  >  A-)  <  P((Ml  J-  >  x)  +  P(5X  <  0 

<  U[(M,M)StA(]  +  P((M,  M)t  >  x) 
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=  -E[(M,  M)t  A  x]  +  P((M,  M)t  >  x) 
x 

—  ~E[(M,M)t  l{{M,M)t<x}\  +  2 P((M,M)t  >  v). 

JC 

To  complete  the  proof,  set  <7  =  p/2  e  (0, 1)  and  integrate  each  side  of  the  last  bound 
with  respect  to  the  measure  qxq~l  dx.  We  have  first 


/ 


00  /.(M,*)2 

*\2  ^  .a  — <7—1  —  F  /  ^  V?-1  ^ 


P((M* )  >  x)  qxc 


1  dx  =  A  / 
L  7o 


qx 


=  £[(m;  A], 


and  similarly 


/» 00 

2o 


P({M,M)t  >  x)^x9_1  dx  =  E[{M,M)qt] 


^1 


Furthermore, 


dv 


<7 

1  -2 


Summarizing,  we  have  obtained  the  bound 


£[(M;)2?]  <  (2  + 

and  we  just  have  to  let  t  ->  00  to  get  the  desired  result.  □ 

Corollary  5.17  Let  M  be  a  continuous  local  martingale  such  that  Mo  —  0.  77te 
condition 


E[{M,M)X£]  <  00 
implies  that  M  is  a  uniformly  integrable  martingale. 

Proof  By  the  case  p  —  1  of  Theorem  5.16,  the  condition  E[(M,M)  <  00 

implies  that  E[M^\  <  00.  Proposition  4.7  (ii)  then  shows  that  the  continuous  local 
martingale  M,  which  is  dominated  by  the  variable  M^,  is  a  uniformly  integrable 
martingale.  □ 

The  condition  E[(M,  M}^2]  <  00  is  weaker  than  the  condition  £'[(M,M)00]  < 
00,  which  ensures  that  Mel2.  The  corollary  can  be  applied  to  stochastic  integrals. 
If  M  is  a  continuous  local  martingale  and  H  is  a  progressive  process  such  that,  for 
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every  t  >  0, 


E 


<  OO  , 


then  Jq  HsdMs  is  a  martingale,  and  formulas  (5.6)  and  (5.9)  for  the  first  moment  and 
the  conditional  expectations  of  f^HsdMs  hold  (of  course  with  t  <  oo). 


5.4  The  Representation  of  Martingales  as  Stochastic 
Integrals 

In  the  special  setting  where  the  filtration  on  Q  is  the  completed  canonical  filtration 
of  a  Brownian  motion,  we  will  now  show  that  all  martingales  can  be  represented  as 
stochastic  integrals  with  respect  to  that  Brownian  motion.  For  the  sake  of  simplicity, 
we  first  consider  a  one-dimensional  Brownian  motion,  but  we  will  discuss  the 
extension  to  Brownian  motion  in  higher  dimensions  at  the  end  of  this  section. 

Theorem  5.18  Assume  that  the  filtration  (jZi)  on  Q  is  the  completed  canonical 
filtration  of  a  real  Brownian  motion  B  started  from  0.  Then,  for  every  random 
variable  Z  G  L2(f2,  ^oo  ,P),  there  exists  a  unique  progressive  process  h  G  L2(B) 
(i.e.  E[J^°  h2ds]  <  oo)  such  that 


hs  d Bs. 


Consequently,  for  every  martingale  M  that  is  bounded  in  L 2  ( respectively,  for  every 
continuous  local  martingale  M),  there  exists  a  unique  process  h  G  L2(B)  (resp. 
h  G  L2oc  ( B ) )  and  a  constant  CgK  such  that 


Mt  —  C  +  f  hs  d  Bs . 

Jo 

Remark  As  the  proof  will  show,  the  second  part  of  the  statement  applies  to  a 
martingale  M  that  is  bounded  in  L2,  without  any  assumption  on  the  continuity 
of  sample  paths  of  M.  This  observation  will  be  useful  later  when  we  discuss 
consequences  of  the  representation  theorem.  Note  that  continuous  local  martingales 
have  continuous  sample  paths  by  definition. 

Lemma  5.19  Under  the  assumptions  of  the  theorem,  the  vector  space  generated  by 
the  random  variables 


exp 


n 


j=  1 
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for  any  choice  ofO  —  t^K  t\  <  •  •  •  <  tn  and  A  i , . . . ,  An  G  R,  is  dense  in  the  space 
L2c(^  5  Z* oo>  P)  q/*  rz//  square-integrable  complex~v  alued  QQ~measnrable  random 
variables. 

Proof  It  is  enough  to  prove  that,  if  Z  G  (£2 ,  jZqq  ,  P)  is  such  that 


E 


Z  exp 


(5.17) 


for  any  choice  of  0  =  to  <  t\  <  •  •  •  <  tn  and  X\ , . . . ,  Xn  el,  then  Z  =  0. 

Fix  0  =  to  <  t\  <  •  •  •  <  tn,  and  consider  the  complex  measure  /i  on  W1  defined 
by 


li(F)  =  E  ZlF(Bh,Bt2  -B 


t\  > 


for  any  Borel  subset  F  of  R”.  Then  (5.17)  exactly  shows  that  the  Fourier  transform 
of  /x  is  identically  zero.  By  the  injectivity  of  the  Fourier  transform  on  complex 
measures  on  W1,  it  follows  that  fi  —  0.  We  have  thus  E[Z  1  a]  =  0  for  every  A  e 
a(Btl,. . .  ,BtJ. 

A  monotone  class  argument  then  shows  that  the  identity  E[Z  1a]  =  0  remains 
valid  for  any  A  e  cr(Bt,t  >  0),  and  then  by  completion  for  any  A  e  ^oo-  It  follows 
that  Z  =  0.  □ 

Proof  of  Theorem  5.18  We  start  with  the  first  assertion.  We  first  observe  that  the 
uniqueness  of  h  is  easy  since,  if  the  representation  of  a  given  variable  Z  holds  with 
two  processes  h  and  h'  in  L2(B ),  we  have 


hence  h  —  h!  in  L2(B). 

Let  us  turn  to  the  existence  part.  Let  J ZX  stand  for  the  vector  space  of  all  variables 
Z  G  E2(£2  ,  X^oo,  P )  for  which  the  property  of  the  statement  holds.  We  note  that  if 
Z  G  J 'if  and  h  is  the  associated  process  in  L2(B ),  we  have 


E[Z2]  =  (E[Z])2  +  E 


It  follows  that  is  a  closed  subspace  of  L2(i? ,  J^x),  P).  Indeed,  if  (Zn)  is  a 
sequence  in  Jrf?  that  converges  to  Z  in  L2(£2 ,  jZoo,  P),  the  processes  h^n)  corre¬ 
sponding  respectively  to  the  variables  Zn  form  a  Cauchy  sequence  in  L2(B ),  hence 
converge  in  L 2  (B)  to  a  certain  process  h  G  L2  ( B )  -  here  we  use  the  Hilbert  space 
structure  of  L2{B)  -  and  it  immediately  follows  that  Z  =  E[Z\  +  /0°°  hs  d Bs. 

Since  XZZ  is  closed,  in  order  to  prove  that  XZZ  —  L2(£2,  XPqq,  P),  we  just  have  to 
verify  that  Jt?  contains  a  dense  subset  of  L2(£2,  AZoq  ,  P) .  Let  0  =  to  <  t\  <  •  •  •  <  tn 
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and  X\, ...  ,Xn  e  M,  and  set  f(s)  =  Yll=i  Write  Sf  for  the  exponential 

j  j  j 

martingale  <g(i  f0f(s )  dBs)  (cf.  Proposition  5.11).  Proposition  5.11  shows  that 


exp 


4f(s)  d B, 


and  it  follows  that  both  the  real  part  and  the  imaginary  part  of  variables  of  the  form 

exP  (i  J2'j= i  ))  are  in  jf?.  By  Lemma  5.19,  linear  combinations  of  such 

random  variables  are  dense  in  L2(£?,  J^oo ,  P) .  This  completes  the  proof  of  the  first 
assertion  of  the  theorem. 

Let  us  turn  to  the  second  assertion.  If  M  is  a  martingale  that  is  bounded  in  L2, 
then  Mqo  £  L2  (£2,  jZqq,  P),  and  thus  can  be  written  in  the  form 


hs  d Bs, 


where  h  e  L2(B).  Thanks  to  (5.9),  it  follows  that 


Mt  =  L[Moo 


£[Moo]  +  f  hs  d Bs 
Jo 


and  the  uniqueness  of  h  is  also  immediate  from  the  uniqueness  in  the  first  assertion. 

Finally,  if  M  is  a  continuous  local  martingale,  we  have  first  Mo  =  C  e  R  because 
the  a-field  contains  only  events  of  probability  zero  or  one.  If  Tn  —  inf{t  >  0  : 
\Mt\  >  n}  we  can  apply  the  case  of  martingales  bounded  in  L 2  to  MTn  and  we  get  a 
process  h ^  e  L 2  (B)  such  that 


=  C  + 


h™  dBs. 


Using  the  uniqueness  of  the  progressive  process  in  the  representation,  we  get  that 
him)  —  l[o, Tm\(s)  hsn)  if  m  <  n,  ds  a.e.,  a.s.  It  is  now  easy  to  construct  a  process 

h  e  L2oc(B )  such  that,  for  every  m,  hi’n)  —  l[o,rm]  (s)  hs ,  d.s’  a.e.,  a.s.  The  representation 
formula  of  the  theorem  follows,  and  the  uniqueness  of  h  is  also  straightforward.  □ 

Consequences  Let  us  give  two  important  consequences  of  the  representation 
theorem.  Under  the  assumptions  of  the  theorem: 

(1)  The  filtration  o  is  right-continuous.  Indeed,  let  t  >  0  and  let  Z  be  im¬ 
measurable  and  bounded.  We  can  find  h  e  L2(B)  such  that 


hsdBs. 
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If  s  >  0,  Z  is  J^t+e- measurable,  and  thus,  using  (5.9), 


Z  =  E[Z  |  #)+e] 


hsdBs. 


When  £  0  the  right-hand  side  converges  in  L 2  to 


E\Z\  +  f  hsdBs. 

Jo 

Thus  Z  is  equal  a.s.  to  an  ^-measurable  random  variable,  and,  since  the 
filtration  is  complete,  Z  is  ^-measurable. 

A  similar  argument  shows  that  the  filtration  o  is  also  left-continuous: 

If,  for  t  >  0,  we  let 


=  v  j? 

se[0,t) 

be  the  smallest  a -field  that  contains  all  a -fields  for  s  G  [0,0,  we  have 

(2)  All  martingales  of  the  filtration  o  have  a  modification  with  continu¬ 
ous  sample  paths.  For  a  martingale  that  is  bounded  in  L2,  this  follows  from  the 
representation  formula  (see  the  remark  after  the  statement  of  the  theorem).  Then 
consider  a  uniformly  integrable  martingale  M  (if  M  is  not  uniformly  integrable, 
we  just  replace  M  by  MtAa  for  every  a  >  0).  In  that  case,  we  have,  for  every 
t  >  0, 


Mt  =  E[M0 o 


By  Theorem  3.18  (whose  application  is  justified  as  we  know  that  the  filtration  is 
right-continuous),  the  process  Mt  has  a  modification  with  cadlag  sample  paths, 

(n) 

and  we  consider  this  modification.  Let  M0 0  be  a  sequence  of  bounded  random 
variables  such  that  M&  — >  M ^  in  L1  as  n  ->  oo.  Introduce  the  martingales 


which  are  bounded  in  L2.  By  the  beginning  of  the  argument,  we  can  assume 
that,  for  every  n ,  the  sample  paths  of  M ^  are  continuous.  On  the  other  hand, 
Doob’s  maximal  inequality  (Proposition  3.15)  implies  that,  for  every  A  >  0, 


sup  | M\n)  —  Mt |  >  A 

f>0 
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It  follows  that  we  can  find  a  sequence  ^  f  o o  such  that,  for  every  k  >  1 , 


P  sup  I M(;,k)  -  M, I  >  2~k  <  2~k. 


L  t>  0 


An  application  of  the  Borel-Cantelli  lemma  now  shows  that 


sup  —  M,\  0 

k  >oo 


and  we  get  that  the  sample  paths  of  M  are  continuous  as  uniform  limits  of 
continuous  functions. 

Multidimensional  extension  Let  us  briefly  describe  the  multidimensional  exten¬ 
sion  of  the  preceding  results.  We  now  assume  that  the  filtration  (J^)  on  Q  is 
the  completed  canonical  filtration  of  a  d-dimensional  Brownian  motion  B  — 
( Bl , . . . ,  Bd )  started  from  0.  Then,  for  every  random  variable  Z  e  L2(X2,  ^oo,  P), 
there  exists  a  unique  d-tuple  (hl, . . . ,  hd )  of  progressive  processes,  satisfying 


oo 


(hls)2  ds  <  oo  ,  V/  G  {1, . . . ,  d}, 


E 


such  that 


Z  =  E[Z]  + 


Similarly,  if  M  is  a  continuous  local  martingale,  there  exist  a  constant  C  and  a  unique 


d-tuple  (hl , . . . ,  hd )  of  progressive  processes,  satisfying 


Vt  >  0,  Vi  G  {1, . . . ,  d}, 


such  that 


The  proofs  are  exactly  the  same  as  in  the  case  d  =  1  (Theorem  5.18).  Consequences 
(1)  and  (2)  above  remain  valid. 
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5.5  Girsanov’s  Theorem 

Throughout  this  section,  we  assume  that  the  filtration  (J^)  is  both  complete  and 
right-continuous.  Our  goal  is  to  study  how  the  notions  of  a  martingale  and  of  a 
semimartingale  are  affected  when  the  underlying  probability  measure  P  is  replaced 
by  another  probability  measure  Q .  Most  of  the  time  we  will  assume  that  P  and  Q  are 
mutually  absolutely  continuous,  and  then  the  fact  that  the  filtration  \ )  is  complete 
with  respect  to  P  implies  that  it  is  complete  with  respect  to  Q.  When  there  is  a  risk  of 
confusion,  we  will  write  EP  for  the  expectation  under  the  probability  measure  P ,  and 
similarly  Eq  for  the  expectation  under  Q.  Unless  otherwise  specified,  the  notions  of  a 
(local)  martingale  or  of  a  semimartingale  refer  to  the  underlying  probability  measure 
P  (when  we  consider  these  notions  under  Q  we  will  say  so  explicitly).  Note  that,  in 
contrast  with  the  notion  of  a  martingale,  the  notion  of  a  finite  variation  process  does 
not  depend  on  the  underlying  probability  measure. 

Proposition  5.20  Assume  that  Q  is  a  probability  measure  on  (£2,£F),  which  is 
absolutely  continuous  with  respect  to  P  on  the  o -field  For  every  t  G  [0,  oo],  let 


dP\&t 

be  the  Radon-Nikodym  derivative  of  Q  with  respect  to  P  on  the  o -field  £Pt.  The 
process  (Dt)t> o  is  a  uniformly  integrable  martingale.  Consequently  (Dt)t> o  has  a 
cadlag  modification.  Keeping  the  same  notation  {Dt)t> o  for  this  modification,  we 
have,  for  every  stopping  time  T, 


d Q 

dP\^f 


Finally,  if  we  assume  furthermore  that  P  and  Q  are  mutually  absolutely  continuous 
on  AEoo,  we  have 


inf  Dt  >  0  ,  P  a.s. 

t>  o 


Proof  If  A  G  we  have 


Q(A)  —  Eq  [1a]  —  EpIIaDqq]  —  EpWaEpIDqq 


and,  by  the  uniqueness  of  the  Radon-Nikodym  derivative  on  &u  it  follows  that 


Dt  —  EpID^ 


a.s. 


Hence  D  is  a  uniformly  integrable  martingale,  which  is  closed  by  D 00.  Theorem  3.18 
(using  the  fact  that  (£Ft)  is  both  complete  and  right-continuous)  then  allows  us  to 
find  a  cadlag  modification  of  {Dt)t> o,  which  we  consider  from  now  on. 
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Then,  if  T  is  a  stopping  time,  the  optional  stopping  theorem  (Theorem  3.22)  gives 
for  every  A  e 


Q(A)  =  Eq[  1a]  =  EpIUDoo]  =  EP[lAEP[Doo  |  &T]]  =  Ep[1aDt], 


and,  since  Dp  is  -measurable,  it  follows  that 


dP\^T' 

Let  us  prove  the  last  assertion.  For  every  s  >  0,  set 

Ts  —  inf{t  >  0  :  Dt  <  s} 

and  note  that  Ts  is  a  stopping  time  as  the  first  hitting  time  of  an  open  set  by  a  cadlag 
process  (recall  Proposition  3.9  and  the  fact  that  the  filtration  is  right-continuous). 
Then,  noting  that  the  event  {Ts  <  oo}  is  -measurable, 

Q(Te  <  oo)  =  EP[l{Te<oo}  DTs]  <  e 

since  Dp  <  g  on  {fe  <  oo}  by  the  right-continuity  of  sample  paths.  It  immediately 
follows  that 


oo 

fi(nm/-<oo}) 

n=  1 


and  since  P  is  absolutely  continuous  with  respect  to  Q  we  have  also 

oo 

P(C\{Tl/n  <  OO})  =0. 

n=  1 

But  this  exactly  means  that,  P  a.s.,  there  exists  an  integer  n  >  1  such  that  T\/n  —  oo, 
giving  the  last  assertion  of  the  proposition.  □ 

Proposition  5.21  Let  D  be  a  continuous  local  martingale  taking  ( strictly )  positive 
values.  There  exists  a  unique  continuous  local  martingale  L  such  that 

D,  =  exp  (Lt  -  }(£,£),)  =  S(L)t. 

Moreover,  L  is  given  by  the  formula 

L,  =  logZ)0  +  f  D~' dDs. 

Jo 
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Proof  Uniqueness  is  an  easy  consequence  of  Theorem  4.8.  Then,  since  D  takes 
positive  values,  we  can  apply  Ito’s  formula  to  log  A  (see  the  remark  before 
Proposition  5.1 1),  and  we  get 


log  D, 


1  f'd{D,D)s 

2  Jo 


t  5 


where  L  is  as  in  the  statement.  □ 

We  now  state  the  main  theorem  of  this  section,  which  explains  the  relation 
between  continuous  local  martingales  under  P  and  continuous  local  martingales 
under  Q. 

Theorem  5.22  (Girsanov)  Assume  that  the  probability  measures  P  and  Q  are 
mutually  absolutely  continuous  on  G.  Let  (Dt)t> o  be  the  martingale  with  cadlag 
sample  paths  such  that,  for  every  t  >  0, 


d Q 

dP\&t' 


Assume  that  D  has  continuous  sample  paths,  and  let  L  be  the  unique  continuous 
local  martingale  such  that  Dt  —  (L)r.  Then,  ifM  is  a  continuous  local  martingale 

under  P,  the  process 


M  —  M—  (M,  L) 

is  a  continuous  local  martingale  under  Q. 

Remark  By  consequences  of  the  martingale  representation  theorem  explained  at 
the  end  of  the  previous  section,  the  continuity  assumption  for  the  sample  paths  of 
D  always  holds  when  is  the  (completed)  canonical  filtration  of  a  Brownian 
motion.  In  applications  of  Theorem  5.22,  one  often  starts  from  the  martingale  ( Dt ) 
to  define  the  probability  measure  Q ,  so  that  the  continuity  assumption  is  satisfied  by 
construction  (see  the  examples  in  the  next  section). 

Proof  The  fact  that  Dt  can  be  written  in  the  form  Dt  —  S’ (L)t  follows  from 
Proposition  5.21  (we  are  assuming  that  D  has  continuous  sample  paths,  and  we  also 
know  from  Proposition  5.20  that  D  takes  positive  values).  Then,  let  T  be  a  stopping 
time  and  let  X  be  an  adapted  process  with  continuous  sample  paths.  We  claim  that, 
if  {XD)t  is  a  martingale  under  P,  then  XT  is  a  martingale  under  Q.  Let  us  verify  the 
claim.  By  Proposition  5.20,  Eq[\XTm\\  —  Ep\\XTAtDT/\t\]  <  oo,  and  it  follows  that 
Xj  e  Ll  (Q).  Then,  let  A  G  STs  and  s  <  t.  Since  A  D  {T  >  s}  G  we  have,  using 
the  fact  that  ( XD)T  is  a  martingale  under  P, 

Ep[lAn{T>s}XTAtDTAt\  =  Ep[lAn{T>s}XTAsE>TAs]- 
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By  Proposition  5.20, 


and  thus,  since  A  D  {T  >  s}  e  J^tas  C  ^tau  it  follows  that 


E()[lAr){T>s}XTAt]  —  Eq[  lAn{T>s}XTAs\- 


On  the  other  hand,  it  is  immediate  that 


E()[lAn{T<s}XTAt]  —  Eq[1AC]{T<s}XTAs]. 


By  combining  with  the  previous  display,  we  have  Eq  [1  AXTAt\  —  Eq  [1aXTAs]  ,  giving 
our  claim.  As  a  consequence  of  the  claim,  we  get  that,  if  XD  is  a  continuous  local 
martingale  under  P,  then  X  is  a  continuous  local  martingale  under  Q. 

Next  let  M  be  a  continuous  local  martingale  under  P,  and  let  M  be  as  in  the 
statement  of  the  theorem.  We  apply  the  preceding  observation  to  X  =  M,  noting 
that,  by  Ito’s  formula, 


since  d (M,  L)s  —  Ds  ld (M,  D)s  by  Proposition  5.21.  We  get  that  MD  is  a  continuous 
local  martingale  under  P,  and  thus  M  is  a  continuous  local  martingale  under  Q.  □ 

Consequences 

(a)  A  process  M  which  is  a  continuous  local  martingale  under  P  remains  a 
semimartingale  under  Q ,  and  its  canonical  decomposition  under  Q  is  M  = 
M+  (M,  L)  (recall  that  the  notion  of  a  finite  variation  process  does  not  depend  on 
the  underlying  probability  measure).  It  follows  that  the  class  of  semimartingales 
under  P  is  contained  in  the  class  of  semimartingales  under  Q. 

In  fact  these  two  classes  are  equal.  Indeed,  under  the  assumptions  of 
Theorem  5.22,  P  and  Q  play  symmetric  roles,  since  the  Radon-Nikodym 
derivative  of  P  with  respect  to  Q  on  the  a -field  is  D~l ,  which  has  continuous 
sample  paths  if  D  does. 

We  may  furthermore  notice  that 


D,  1  =  exp  (  —  L,  +  (L,  L),  -  1  (L,  L)()  =  exp  (  —  Lt  —  1  (L,  L)t)  =  S (~L)„ 
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where  L  =  L  —  (L,  L)  is  a  continuous  local  martingale  under  Q ,  and  (L,  L)  = 
(L,  L).  So,  under  the  assumptions  of  Theorem  5.22,  the  roles  of  P  and  Q  can  be 
interchanged  provided  D  is  replaced  by  D~{  and  L  is  replaced  by  —  L. 

(b)  Let  X  and  Y  be  two  semimartingales  (under  P  or  under  Q ).  The  bracket  ( X ,  Y) 
is  the  same  under  P  and  under  Q.  In  fact  this  bracket  is  given  in  both  cases  by 
the  approximation  of  Proposition  4.21  (this  observation  was  used  implicitly  in 
(a)  above). 

Similarly,  if  H  is  a  locally  bounded  progressive  process,  the  stochastic 
integral  H  •  X  is  the  same  under  P  and  under  Q.  To  see  this  it  is  enough  to 
consider  the  case  when  X  —  M  is  a  continuous  local  martingale  (under  P). 
Write  (H  •  M)P  for  the  stochastic  integral  under  P  and  (H  •  M)q  for  the  one 
under  Q.  By  linearity, 


C H  •  M)P  =  ( H  •  M)p  -  H  •  (Af,  L)  =  (H  •  M)P  -  ((//  •  M)P,  L), 


and  Theorem  5.22  shows  that  ( H-M)P  is  a  continuous  local  martingale  under  Q. 
Furthermore  the  bracket  of  this  continuous  local  martingale  with  any  continuous 
local  martingale  N  under  Q  is  equal  to  H  •  (Af ,  N)  —  H  •  (Af ,  Af ) ,  and  it  follows 
from  Theorem  5.6  that  ( H  •  M)P  —  (H  •  M)q  hence  also  (H  •  M)P  —  (H  •  M)q. 

With  the  notation  of  Theorem  5.22,  set  M  —  ^ q(M ).  Then  maps  the 
set  of  all  P-continuous  local  martingales  onto  the  set  of  all  Q-continuous  local 
martingales.  One  easily  verifies,  using  the  remarks  in  (a)  above,  that  @p  o  ^  = 
Id.  Furthermore,  the  mapping  commutes  with  the  stochastic  integral:  if  H  is 
a  locally  bounded  progressive  process,  H  •  (^0  —  *  Af). 

(c)  Suppose  that  M  —  B  is  an  (J^)- -Brownian  motion  under  P,  then  P  =  B  —  (B,L) 
is  a  continuous  local  martingale  under  Q ,  with  quadratic  variation  {B,B)t  = 
( B,B)t  =  t.  By  Theorem  5.12,  it  follows  that  B  is  an  (J^)-Brownian  motion 
under  Q. 

In  most  applications  of  Girsanov’s  theorem,  one  constructs  the  probability 
measure  Q  in  the  following  way.  Start  from  a  continuous  local  martingale  L  such 
that  L0  =  0  and  (L,  L)^  <  o o  a.s.  The  latter  condition  implies  that  the  limit  L0 0  :  = 
lim^ooZ^  exists  a.s.  (see  Exercise  4.24).  Then  <£(L)t  is  a  nonnegative  continuous 
local  martingale  hence  a  supermartingale  (Proposition  4.7),  which  converges  a.s. 
to  <f(L) oo  =  exp(Loo  —  ^(L,  L)oo),  and  P[<f(L)oo]  <  1  by  Fatou’s  lemma.  If  the 
property 


W£)oo]  =  1  (5.18) 

holds,  then  S’iL)  is  a  uniformly  integrable  martingale  (by  Fatou’s  lemma  again,  one 
has  <£(L)t  >  ^[^(L)^  |  #)],  but  (5.18)  implies  that  E[$(L) oo]  =  P[^(L)0]  = 
P[^(L)J  for  every  ^  >  0).  If  we  let  Q  be  the  probability  measure  with  density 
S’iL )oo  with  respect  to  P,  we  are  in  the  setting  of  Theorem  5.22,  with  Dt  =  (L)f. 

It  is  therefore  very  important  to  give  conditions  that  ensure  that  (5.18)  holds. 
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Theorem  5.23  Let  Lbe  a  continuous  local  martingale  such  that  Lo  =  0.  Consider 
the  following  properties: 

(i)  £[exp  |(L,  Z^oo]  <  oo  (Novikov’s  criterion); 

(ii)  L  is  a  uniformly  integrable  martingale,  and  £[exp  ^ L oo]  <  oo  (Kazamaki’s 
criterion ); 

(iii)  S (L)  is  a  uniformly  integrable  martingale. 

Then,  (i)  =4>  (ii)  =4>  (iii). 

Proof  (i)  =4>  (ii)  Property  (i)  implies  that  E[(L,  L)^]  <  oo  hence  also  that  L  is  a 
continuous  martingale  bounded  in  L 2  (Theorem  4.13).  Then, 

exp^Loo  =  (^(Ooo)1/2  (exp(i(L,L)oo))1/2 
so  that,  by  the  Cauchy-Schwarz  inequality, 

flexplioo]  <  {E[S{L)0^yl\E[txS^{L,L)00)]yi2 

<  (EfcxpifL^)^)])1/2  <  oo. 

(ii)  (iii)  Since  L  is  a  uniformly  integrable  martingale,  Theorem  3.22  shows 
that,  for  any  stopping  time  T ,  we  have  LT  —  E[L ^  |  &T].  Jensen’s  inequality  then 
gives 


exp  -Lt  <  E[ex p  -Loo  |  &T\. 

By  assumption,  L[ex p  \L0 c]  <  oo,  which  implies  that  the  collection  of  all  variables 
of  the  form  L[exp  ^L0 0  |  for  any  stopping  time  T,  is  uniformly  integrable. 

The  preceding  bound  then  shows  that  the  collection  of  all  variables  exp  \Lt,  for  any 
stopping  time  T,  is  also  uniformly  integrable. 

For  0  <  a  <  1,  set  z\a)  —  exp(y^).  Then,  one  easily  verifies  that 

gw,  =  {£(L)ty2  c 4a)y~a2 . 

If  r  e  SZ  and  T  is  a  stopping  time,  Holder’s  inequality  gives 

E[lrS’(aL)T]  <  E[S’(L)Tf  E[irZyy-a2  <  E[lrZ <  E[lr  exp  ]-LT]Ml~a) . 

In  the  second  inequality,  we  used  the  property  E[S(L)T\  <  1,  which  holds  by 
Proposition  3.25  because  S(L)  is  a  nonnegative  supermartingale  and  S(L)q  —  1. 
In  the  third  inequality,  we  use  Jensen’s  inequality,  noting  that  >  1.  Since  the 
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collection  of  all  variables  of  the  form  exp  \ LT ,  for  any  stopping  time  T,  is  uniformly 
integrable,  the  preceding  display  shows  that  so  is  the  collection  of  all  variables 
£ ( aL)T  for  any  stopping  time  T.  By  the  definition  of  a  continuous  local  martingale, 
there  is  an  increasing  sequence  Tn  \  oo  of  stopping  times,  such  that,  for  every  n , 
£{aL)tATn  is  a  martingale.  If  0  <  s  <  t,  we  can  use  uniform  integrability  to  pass  to 
the  limit  n  ->  oo  in  the  equality  E[£ ( aL)tATn  \  ^s\  —  £ (< aL)sATn  and  we  get  that 
£ ( aL )  is  a  uniformly  integrable  martingale.  It  follows  that 

1  -  ElS'iaL)^]  <  ElgiLUfElZ^]1-*2  <  E\S (LM^exp 

using  again  Jensen’s  inequality  as  above.  When  a  ->  1,  this  gives  E[£(L) oo]  >  1 
henc  e£[<f(L)oo]  =  1.  □ 


5.6  A  Few  Applications  of  Girsanov’s  Theorem 


In  this  section,  we  describe  a  few  applications  of  Girsanov’s  theorem,  which 
illustrate  the  strength  of  the  previous  results. 

Constructing  solutions  of  stochastic  differential  equations  Let  b  be  a  bounded 
measurable  function  on  M+  x  R.  We  assume  that  there  exists  a  function  g  e 
L2(M+,  «^(R_|_),  dO  such  that  |Z?(r,  x)|  <  g(0  for  every  (7,  v)  e  M+  x  R.  This  holds 
in  particular  if  there  exists  an  A  >  0  such  that  \b\  is  bounded  on  [0,  A]  x  M+  and 
vanishes  on  (A,  oo)  x  R+. 

Let  B  be  an  (J^) -Brownian  motion.  Consider  the  continuous  local  martingale 

Lt  =  f  b(s,  Bs)  dBs 

Jo 

and  the  associated  exponential  martingale 

Dt  —  £  (L)t  =  exp  ^  y  d5,  J  b(s>  ^)2(i^)  • 

Our  assumption  on  b  ensures  that  condition  (i)  of  Theorem  5.23  holds,  and  thus  D 
is  a  uniformly  integrable  martingale.  We  set  Q  —  D0 0  •  P.  Girsanov’s  theorem,  and 
remark  (c)  following  the  statement  of  this  theorem,  show  that  the  process 


ds 


is  an  (^t) -Brownian  motion  under  Q. 
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We  can  restate  the  latter  property  by  saying  that,  under  the  probability  measure 
Q ,  there  exists  an  (J^)) -Brownian  motion  /3  such  that  the  process  X  —  B  solves  the 
stochastic  differential  equation 

dXt  =  d pt  +  b(t,Xt)  d t. 


This  equation  is  of  the  type  that  will  be  considered  in  Chap.  7  below,  but  in  contrast 
with  the  statements  of  this  chapter,  we  are  not  making  any  regularity  assumption 
on  the  function  b.  It  is  remarkable  that  Girsanov’s  theorem  allows  one  to  construct 
solutions  of  stochastic  differential  equations  without  regularity  conditions  on  the 
coefficients. 

The  Cameron-Martin  formula  We  now  specialize  the  preceding  discussion  to 
the  case  where  b(t,  x)  does  not  depend  on  v.  We  assume  that  b(t,x)  —  g(t ),  where 
g  e  L2(M+,  <^(R+),  df),  and  we  also  set,  for  every  t  >  0, 

h(t)  =  [  g(s )  ds. 

Jo 

The  set  Jif  of  all  functions  h  that  can  be  written  in  this  form  is  called  the  Cameron- 
Martin  space.  If  h  e  we  sometimes  write  h  —  g  for  the  associated  function  in 
L2(M+,  <^(R+),  d t)  (this  is  the  derivative  of  h  in  the  sense  of  distributions). 

As  a  special  case  of  the  previous  discussion,  under  the  probability  measure 


Q  :=  Doc  -P  =  exp 


•  oo 


g  (s)  d Bs 


the  process  fit  Bt  —  h(t )  is  a  Brownian  motion.  Hence,  for  every  nonnegative 
measurable  function  0  on  C(R+,  R), 


EP[D oo  o)]  =  EQ[0((Bt)t> 0)]  =  EQ[0((fit  +  h(t))t>0)] 

=  EP[<P((Bt  +  h(t))t>  0)]. 


The  equality  between  the  two  ends  of  the  last  display  is  the  Cameron-Martin 
formula.  In  the  next  proposition,  we  write  this  formula  in  the  special  case  of  the 
canonical  construction  of  Brownian  motion  on  the  Wiener  space  (see  the  end  of 
Sect.  2.2). 

Proposition  5.24  (Cameron-Martin  formula)  Let  W(dw)  be  the  Wiener  measure 
on  C(R+,  R),  and  let  h  be  a  function  in  the  Cameron-Martin  space  JXC .  Then,  for 
every  nonnegative  measurable  function  0  on  C(R+,  R), 


/ 


W(dw)  0(w  +  h)  —  /  W(dw)  exp 


■/ 


oo 


h(s)  dw(s) 


0 


h(s )2  d^0(w). 
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Remark  The  integral  /0°°  h(s )  dw(V)  is  a  stochastic  integral  with  respect  to  w(s) 
(which  is  a  Brownian  motion  under  W(dw )),  but  it  can  also  be  viewed  as  a  Wiener 
integral  since  the  function  h(s )  is  deterministic.  The  Cameron-Martin  formula  can 
be  established  by  Gaussian  calculations  that  do  not  involve  stochastic  integrals  or 
Girsanov’s  theorem  (see  e.g.  Chapter  1  of  [62]).  Still  it  is  instructive  to  derive  this 
formula  as  a  special  case  of  Girsanov’s  theorem. 

The  Cameron-Martin  formula  gives  a  “quasi-invariance”  property  of  Wiener 
measure  under  the  translations  by  functions  of  the  Cameron-Martin  space:  The 
image  of  Wiener  measure  W(dw)  under  the  mapping  w^w  +  /i  has  a  density  with 
respect  to  W(dw)  and  this  density  is  the  terminal  value  of  the  exponential  martingale 
associated  with  the  martingale  h(s)dw(s). 

Law  of  hitting  times  for  Brownian  motion  with  drift  Let  B  be  a  real  Brownian 
motion  with  Bo  —  0,  and  for  every  a  >  0,  let  Ta  inf{£  >  0  :  Bt  —  a).  If  c  e  R  is 
given,  we  aim  at  computing  the  law  of  the  stopping  time 

Ua  :=  inf{t  >  0  :  Bt  +  ct  =  a). 

Of  course,  if  c  —  0,  we  have  Ua  =  Ta,  and  the  desired  distribution  is  given  by 
Corollary  2.22.  Girsanov’s  theorem  (or  rather  the  Cameron-Martin  formula)  will 
allow  us  to  derive  the  case  where  c  is  arbitrary  from  the  special  case  c  —  0. 

Fix  t  >  0  and  apply  the  Cameron-Martin  formula  with 

h(s)  =  c  l{s<t}  ,  h(s)  =  c(s  A  t)  , 
and,  for  every  w  e  C(R+,  R), 


0(w)  —  l{max[o,r]  w(s)>a}- 


It  follows  that 


P(Ua  <t)=  E[<P(B  +  h)] 


=  E 


oo 


0(5)  exp  (  j  h(s )  d/iv  —  I  J  h(s )2  ds) 


=  £[1  {Ta<t}  exp (cB,  -  —t))\ 


c 


=  £[1  {Ta<t}  exp (cBtATa  -  —  (t  A  Ta))] 


=  E[l{Ta<t}  exp(ca 
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a 


a 

2s 


V2 


TC  S' 


_ e-  Ys  (' a~cs )2 


where,  in  the  fourth  equality,  we  used  the  optional  stopping  theorem  (Corol¬ 
lary  3.23)  to  write 

c2  c2 

E[exp(cBt  -  —t)  |  JWJ  =  exp (cBtATrl  -  —(t  A  Ta)), 

and  we  also  made  use  of  the  explicit  density  of  Ta  given  in  Corollary  2.22.  This 
calculation  shows  that  the  variable  Ua  has  a  density  on  M+  given  by 


f{s)  = 


a 


V2 


TVS' 


By  integrating  this  density,  we  can  verify  that 


P(Ua  <  oo)  = 


1  if  c  >  0, 
e2ca  if  c  <  0, 


which  may  also  be  checked  more  easily  by  applying  the  optional  stopping  theorem 
to  the  continuous  martingale  exp(— 2 c(Bt  +  ct )). 


Exercises 

In  the  following  exercises,  processes  are  defined  on  a  probability  space  (£2,  &,P) 
equipped  with  a  complete  filtration  (^t)te[  o,oo]> 

Exercise  5.25  Let  B  be  an  -Brownian  motion  with  Z?o  =  0,  and  let  H  be  an 
adapted  process  with  continuous  sample  paths.  Show  that  j-  HsdBs  converges  in 
probability  when  t  0  and  determine  the  limit. 

Exercise  5.26 

1.  Let  B  be  a  one-dimensional  (^i) -Brownian  motion  with  Z?0  =  0-  Let /  be  a  twice 
continuously  differentiable  function  on  R,  and  let  g  be  a  continuous  function  on 
R.  Verify  that  the  process 


Xt  =  f(Bt )  exp 
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is  a  semimartingale,  and  give  its  decomposition  as  the  sum  of  a  continuous  local 
martingale  and  a  finite  variation  process. 

2.  Prove  that  X  is  a  continuous  local  martingale  if  and  only  if  the  function/  satisfies 
the  differential  equation 

f"  =  2gf. 

3.  From  now  on,  we  suppose  in  addition  that  g  is  nonnegative  and  vanishes  outside  a 
compact  subinterval  of  (0,  oo).  Justify  the  existence  and  uniqueness  of  a  solution 
/i  of  the  equation  /"  =  2 gf  such  that/i(0)  =  1  and /((())  =  0.  Let  a  >  0  and 
Ta  —  inf{t  >  0  \  Bt  —  a).  Prove  that 

£[“p  (■/"*<B-)ds)]=^ 

Exercise  5.27  (, Stochastic  calculus  with  the  sup  remum)  Preliminary  question.  Let 
m  :  M+  — >  R  be  a  continuous  function  such  that  m(0)  =  0,  and  let  s  :  R+  — >  R 
be  the  monotone  increasing  function  defined  by 

s(t)  =  sup  m(r). 

0  <r<t 

Show  that,  for  every  bounded  Borel  function  h  on  R  and  every  t  >  0, 

(/(r)  —  m(r))  /z(r)  d^(r)  =  0. 

(One  may  first  observe  that  f  l/(r)  d^(r)  =  0  for  every  open  interval  I  that  does  not 
intersect  {r  >  0  :  s(r)  =  m(r)}.) 

1.  Let  M  be  a  continuous  local  martingale  such  that  Mo  =  0,  and  for  every  t  >  0, 
let 

S*  =  sup  Mr. 

o</-</ 

Let  (p  :  M+  — >  R  be  a  twice  continuously  differentiable  function.  Justify  the 
equality 

XX>  =  ^(0)+  [  <p\ss)ds5. 

Jo 
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2.  Show  that 


(St  -  Mt)  <p(St )  =  <P(St)  -  [  <p(Ss)  d Ms 

Jo 

where  @(x)  —  <p(y)  dy  for  every  v  G  R. 

3.  Infer  that,  for  every  A  >  0, 


e~XSt  +  A  (St-  Mt)e~XSt 


is  a  continuous  local  martingale. 

4.  Let  a  >  0  and  T  —  inf {t  >  0  :  St—Mt  =  a}.  We  assume  that  (M,  M)^  —  oo  a.s. 
Show  that  T  <  oo  a.s.  and  St  is  exponentially  distributed  with  parameter  1/a. 

Exercise  5.28  Let  B  be  an  -Brownian  motion  started  from  1.  We  fix  s  e  (0, 1) 
and  set  Ts  —  inf {t  >  0  :  Bt  —  s}.  We  also  let  A  >  0  and  a  e  M\{0}. 

1.  Show  that  Zt  —  (BtATe)a  is  a  semimartingale  and  give  its  canonical  decomposi¬ 
tion  as  the  sum  of  a  continuous  local  martingale  and  a  finite  variation  process. 

2.  Show  that  the  process 

.  ,  ftATs  ds\ 

Zt  =  (BtATsy  exp  (  -  A  |  —) 

is  a  continuous  local  martingale  if  a  and  A  satisfy  a  polynomial  equation  to  be 
determined. 

3.  Compute 


exp 


Exercise  5.29  Let  (Xt)t>o  be  a  semimartingale.  We  assume  that  there  exists  an 
-Brownian  motion  (Bt)t> o  started  from  0  and  a  continuous  function  b  :  R  — > 
R,  such  that 


Xt  =  Bt+  [  b(Xs)  As. 

Jo 

1.  Let  F  :  R  — >  R  be  a  twice  continuously  differentiable  function  on  R.  Show 
that,  for  E(AQ  to  be  a  continuous  local  martingale,  it  suffices  that  F  satisfies  a 
second-order  differential  equation  to  be  determined. 

2.  Give  the  solution  of  this  differential  equation  which  is  such  that  E(0)  =  0  and 
F'  (0)  =  1.  In  what  follows,  F  stands  for  this  particular  solution,  which  can  be 
written  in  the  form  F(x)  =  J0  exp(— 2/3 (y))  dy,  with  a  function  /3  that  will  be 
determined  in  terms  of  b. 
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3.  In  this  question  only,  we  assume  that  b  is  integrable,  i.e.  fR  \b(x)\dx  <  oo. 

(a)  Show  that  the  continuous  local  martingale  Mt  —  F(Xt)  is  a  martingale. 

(b)  Show  that  (M,  M)^  —  oo  a.s. 

(c)  Infer  that 


limsupA^  =  +oo  ,  liminfW  =  — oo  ,  a.s. 

t—>o o  t—>oo 

4.  We  come  back  to  the  general  case.  Let  c  <  0  and  d  >  0,  and 

Tc  =  inf{f  >  0  :  Xt  <  c}  ,  Td  =  inf {t  >  0  :  Xt  >  d}  . 

Show  that,  on  the  event  {Tc  A  Tci  —  oo},  the  random  variables  \Bn+\  —  Bn |,  for 
integers  n  >  0,  are  bounded  above  by  a  (deterministic)  constant  which  does  not 
depend  on  n.  Infer  that  P(TC  A  Td  —  oo)  =  0. 

5.  Compute  P(TC  <  Td)  in  terms  of  F(c )  of  F(d). 

6.  We  assume  that  b  vanishes  on  (— oo,  0]  and  that  there  exists  a  constant  a  >  1/2 
such  that  b(x )  >  ot/x  for  every  jc  >  1 .  Show  that,  for  every  s  >  0,  one  can  choose 
c  <  0  such  that 


P(Tn  <  Tc,  for  every  n  >  1)  >  1  —  s. 

Infer  that  Xt  — >  +oo  as  t  — >  oo,  a.s.  (Hint:  Observe  that  the  continuous  local 
martingale  MtaTc  is  bounded.) 

7.  Suppose  now  b(x )  =  1/ (2x)  for  every  x>\.  Show  that 

liminf  Xt  —  — oo  ,  a.s. 

t—>OQ 

Exercise  5.30  (Levy  area)  Let  (Xt,  Yt)t> o  be  a  two-dimensional  (#))-Brownian 
motion  started  from  0.  We  set,  for  every  t  >  0  : 

A/t—  Xsd  Ys  —  /  LydX^  (Levy’s  area). 

Jo  Jo 

1.  Compute  (&/,  &/)t  and  infer  that  (srft)t>o  is  a  square-integrable  (true)  martingale. 

2.  Let  A  >  0.  Justify  the  equality 


E[eiXA  =  E[  cos(A^)]. 
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3.  Let  /  be  a  twice  continuously  differentiable  function  on  M+.  Give  the  canonical 
decomposition  of  the  semimartingales 


Zt  —  cos  (A  £%), 

W,  =  +  y2)  +  fit). 


Verify  that  (Z,W)t  =  0. 

4.  Show  that,  for  the  process  Zt  eWt  to  be  a  continuous  local  martingale,  it  suffices 
that  /  solves  the  differential  equation 


f'(t)=f(t)2-k2. 


5.  Let  r  >  0.  Verify  that  the  function 

fit)  =  -  log  cosh  (A  (r  - 1 )) 


solves  the  differential  equation  of  question  4.  and  derive  the  formula 


1 

cosh  (A  r) 


Exercise  5.31  ( Squared  Bessel  processes )  Let  B  be  an  (J^)- -Brownian  motion 
started  from  0,  and  let  X  be  a  continuous  semimartingale.  We  assume  that  X  takes 
values  in  M+,  and  is  such  that,  for  every  t  >  0, 


—  x  2 


f  Vx 

Jo 


dBs  +  a  t 


where  v  and  a  are  nonnegative  real  numbers. 

1.  Let /  :  M+  — >  M+  be  a  continuous  function,  and  let  <p  be  a  twice  continuously 
differentiable  function  on  M+,  taking  strictly  positive  values,  which  solves  the 
differential  equation 


cp"  =  2  ftp 

and  satisfies  99(0)  =  1  and  ^(l)  =  0.  Observe  that  the  function  99  must  then  be 
decreasing  over  the  interval  [0, 1]. 

We  set 


2  (pit) 
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for  every  t  >  0.  Verify  that  we  have,  for  every  t  >  0, 

i/(f)  +  2u(t)2  =f(t ), 


then  show  that,  for  every  t  >  0, 


w(y)  dVv  —  2  f  u(s)2Xs  dv 

Jo 


We  set 


/(s)  Xs  d.v. 


2.  Show  that,  for  every  t  >  0, 


cp(t)~a/2sY'  =S(N)t 

where  S(N),  =  exp (7Vf  —  l(N,N)t)  denotes  the  exponential  martingale  associ- 
ated  with  the  continuous  local  martingale 


Nt  —  u(0)x  +  2  f  u(s)  ^/x~sd Bs. 

Jo 


3.  Infer  from  the  previous  question  that 


CXP  (  _  /  f(s)Xsdsj  =  <p(  1)"/2  exp(^'(0)) 


4.  Let  A  >  0.  Show  that 


exp  ^  —  A  I  X5d^j  =  (cosh(\/2A))  01  ^2  exp(— -  V2A  tanh(V2A)). 


5.  Show  that,  if  P  —  (Pt)t> o  is  a  real  Brownian  motion  started  from  y,  one  has,  for 
every  A  >  0, 

exp  ^  —  A  J  P2  ds^j  =  (cosh(\/2A))_1/2  exp(-y  V2A  tanh(V2A)). 


Exercise  5.32  (Tanaka ’s  formula  and  local  time )  Let  B  be  an  (J^)-Brownian 
motion  started  from  0.  For  every  s  >  0,  we  define  a  function  g£  :  R  — >  R  by 
setting  ge(v)  =  \fs  +  x2. 
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1.  Show  that 


gs  (Bt)  —  g£  (0)  +  Met  +  A£t 


where  M£  is  a  square  integrable  continuous  martingale  that  will  be  identified  in 
the  form  of  a  stochastic  integral,  and  A£  is  an  increasing  process. 

2.  We  set  sgn(x)  =  l{x>o}  —  l{x<o}  f°r  every  xgI.  Show  that,  for  every  t  >  0, 


Infer  that  there  exists  an  increasing  process  L  such  that,  for  every  t  >  0, 


d Bs  +  Lt  . 


3.  Observing  that  A£  — >  Lt  when  e  — >  0,  show  that,  for  every  S  >  0,  for  every 
choice  of  0  <  u  <  v,  the  condition  (\Bt\  >  S  for  every  t  G  [u,v])  a.s.  implies 
that  Lv  =  Lu.  Infer  that  the  function  t  \->  Lt  is  a.s.  constant  on  every  connected 
component  of  the  open  set  {t  >  0  :  Bt  ^  0}. 

4.  We  set  f}t  —  sgn (Bs)  d Bs  for  every  t  >  0.  Show  that  (Pt)t>o  is  an  (J^)- 

Brownian  motion  started  from  0. 

5.  Show  that  Lt  —  sup  (—ft),  a.s.  (In  order  to  derive  the  bound  Lt  <  sup(— ^),  one 

,V</ 

may  consider  the  last  zero  of  B  before  time  t ,  and  use  question  3.)  Give  the  law 
of  Lt. 

6.  For  every  s  >  0,  we  define  two  sequences  of  stopping  times  (S£n)n>  i  and  (T£)n>  i, 
by  setting 


rf  =  inf{t  >  0  : 


and  then,  by  induction, 

SS-H  =  inf{?  >K:B,  =  0} ,  r„£+1  =  inf{r  >  5^+1  :  |B(|  =  e}. 

For  every  t  >  0,  we  set  N£  =  sup{n  >  1  :  T£  <  t},  where  sup  0  =  0.  Show  that 

sNE  Lt. 


(One  may  observe  that 


A  + 


sgn(Bs)dBs  =  sNE  +  r£ 


where  the  “remainder”  r£  satisfies  \r£\  <  s.) 
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7.  Show  that  Nj /  +Jt  converges  in  law  as  t  ->  oo  to  \U\,  where  U  is  ^(0,1)- 
distributed. 

(Many  results  of  Exercise  5.32  are  reproved  and  generalized  in  Chap.  8.) 

Exercise  5.33  (Study  of  multidimensional  Brownian  motion )  Let  Bt  — 
(Blt,  B2, . . . ,  7?f)  be  an  V-dimensional  (J^) -Brownian  motion  started  from 
v  =  (x\ , . . . ,  x^)  G  Rn.  We  suppose  that  N  >  2. 

1.  Verify  that  \Bt\2  is  a  continuous  semimartingale,  and  that  the  martingale  part  of 
\Bt\ 2  is  a  true  martingale. 

2.  We  set 


with  the  convention  that  ^  =  0  if  |7?s|  =  0.  Justify  the  definition  of  the 
stochastic  integrals  appearing  in  the  definition  of  fit,  then  show  that  the  process 
(Pi)t> o  is  an  (J^)-Brownian  motion  started  from  0. 

3.  Show  that 


Bt 


x 


f 


+  2  /  Bs  |  d/3s  +  N  t. 


4.  From  now  on,  we  assume  that  v  ^  0.  Let  e  G  (0,  \x\)  and  Ts  —  inf{t  >  0  :  \Bt\  < 
s}.  We  set  f(a)  —  log  a  if  N  =  2,  and  f(a)  —  a2~N  if  N  >  3,  for  every  a  >  0. 
Verify  that/(|Z^Are  |)  is  a  continuous  local  martingale. 

5.  Let  7?  >  |x|  and  Sr  =  inf {t  >  0  :  |7^|  >  7?}.  Show  that 


F(Le  <  S*)  = 


f(R)  ~/OI) 

/(*)  -/(£)  ‘ 


Observing  that  P(TS  <  Sr)  — >  0  when  e  ->  0,  show  that  Bt  ^  0  for  every  t  >  0, 
a.s. 

6.  Show  that,  a.s.,  for  every  t  >  0, 


Bt 


x 


+  fit  + 


N-  1 


/ 


ds 

Led 


7.  We  assume  that  N  >  3.  Show  that  \Bt\  — >  oo  when  t  ->  oo,  a.s.  (7/mt:  Observe 

that  is  a  nonnegative  supermartingale.) 

8.  We  assume  N  —  3.  Using  the  form  of  the  Gaussian  density,  verify  that  the 
collection  of  random  variables  (|7^|-1)r>0  is  bounded  in  L2 .  Show  that  (|7^|_1)r>o 
is  a  continuous  local  martingale  but  is  not  a  (true)  martingale. 
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0 Chapter  7  presents  a  slightly  different  approach  to  the  results  of  this  exercise,  see 
in  particular  Proposition  7.16.) 

Exercise  5.34  ( Application  of  the  Cameron-Martin  formula)  Let  B  be  an  (J^)- 
Brownian  motion  started  from  0.  We  set  B*  =  sup{|5s|  :  s  <  t}  for  every  t  >  0. 

1.  Set  U\  —  inf {t  >  0  :  \Bt\  =  1}  and  V\  —  inf{t  >  U\  :  Bt  —  0}.  Justify 
the  equality  P(By  <  2)  =  1/2,  and  then  show  that  one  can  find  two  constants 
a  >  0  and  y  >  0  such  that 


P(V i  >a9B*l<2)  =  y>  0. 

2.  Show  that,  for  every  integer  n  >  1,  P(B*a  <  2)  >  yn.  Hint :  Construct  a  suitable 
sequence  V\ ,  V2, . . .  of  stopping  times  such  that,  for  every  n  >  2, 


P(Vn  >  wa  ,^7  <  2)  >  yP(Vn-i  >(n-  l)ot  ,Byn_x  <  2). 

Conclude  that,  for  every  e  >  0  and  t  >  0,  ^(5*  <  ^)  >  0. 

3.  Let  h  be  a  twice  continuously  differentiable  function  on  M+  such  that  /z(0)  =  0, 
and  let  K  >  0.  Via  a  suitable  application  of  Ito’s  formula,  show  that  there  exists 
a  constant  A  such  that,  for  every  e  >  0, 


<  As 


a.s.  on  the  event  {Bf  <  s}. 


4.  We  set  Xt  —  Bt  —  h(t)  and  X*  =  sup{|X5|  :  s  <  t}.  Infer  from  question  3.  that 


lim  P(X\  1 

40  P(B*  <  e) 


=  exp 


( ■ -'2  f  ",,s>2  ds) 


Notes  and  Comments 

The  reader  who  wishes  to  learn  more  about  the  topics  of  this  chapter  is  strongly 
advised  to  look  at  the  excellent  books  by  Karatzas  and  Shreve  [49],  Revuz  and 
Yor  [70]  and  Rogers  and  Williams  [72].  A  more  concise  introduction  to  stochastic 
integration  can  also  be  found  in  Chung  and  Williams  [10]. 

Stochastic  integrals  with  respect  to  Brownian  motion  were  introduced  by  Ito  [36] 
in  1944.  His  motivation  was  to  give  a  rigorous  approach  to  the  stochastic  differential 
equations  that  govern  diffusion  processes.  Doob  [15]  suggested  to  study  stochastic 
integrals  as  martingales.  Several  authors  then  contributed  to  the  theory,  including 
Kunita  and  Watanabe  [50]  and  Meyer  [60].  We  have  chosen  to  restrict  our  attention 
to  stochastic  integration  with  respect  to  continuous  semimartingales.  The  reader 
interested  in  the  more  general  case  of  semimartingales  with  jumps  can  consult  the 
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treatise  of  Dellacherie  and  Meyer  [14]  and  the  more  recent  books  of  Protter  [63] 
or  Jacod  and  Shiryaev  [44].  Ito’s  formula  was  derived  in  [40]  for  processes  that  are 
stochastic  integrals  with  respect  to  Brownian  motions,  and  in  our  general  context  it 
appeared  in  the  work  of  Kunita  and  Watanabe  [50].  Theorem  5. 12,  at  least  in  the  case 
d  —  1 ,  is  usually  attributed  to  Levy,  although  it  seems  difficult  to  find  this  statement 
in  Levy’s  work  (see  however  [54,  Chapitre  III]).  Theorem  5.13  showing  that  any 
continuous  martingale  can  be  written  as  a  time-changed  Brownian  motion  is  due  to 
Dambis  [11]  and  Dubins-Schwarz  [17].  The  Burkholder-Davis-Gundy  inequalities 
appear  in  [7] ,  see  also  the  expository  article  of  Burkholder  [6]  for  the  history  of 
these  famous  inequalities.  Theorem  5.18  goes  back  to  Ito  [39]  -  in  the  different 
form  of  the  chaos  decomposition  of  Wiener  functionals  -  and  was  a  great  success  of 
the  theory  of  stochastic  integration.  This  theorem  and  its  numerous  extensions  have 
found  many  applications  in  the  area  of  mathematical  finance.  Girsanov’s  theorem 
appears  in  [29]  in  1960,  whereas  the  Cameron-Martin  formula  goes  back  to  [8]  in 
1944.  Applications  of  Girsanov’s  theorem  to  stochastic  differential  equations  are 
developed  in  the  book  [77]  of  Stroock  and  Varadhan.  Exercise  5.30  is  concerned 
with  the  so-called  Levy  area  of  planar  Brownian  motion,  which  was  studied  by 
Levy  [53,  54]  with  a  different  definition.  Exercise  5.31  is  inspired  by  Pitman  and 
Yor  [67]. 


Chapter  6 

General  Theory  of  Markov  Processes 


Our  goal  in  this  chapter  is  to  give  a  concise  introduction  to  the  main  ideas  of  the 
theory  of  continuous  time  Markov  processes.  Markov  processes  form  a  fundamental 
class  of  stochastic  processes,  with  many  applications  in  real  life  problems  outside 
mathematics.  The  reason  why  Markov  processes  are  so  important  comes  from  the 
so-called  Markov  property,  which  enables  many  explicit  calculations  that  would 
be  intractable  for  more  general  random  processes.  Although  the  theory  of  Markov 
processes  is  by  no  means  the  central  topic  of  this  book,  it  will  play  a  significant 
role  in  the  next  chapters,  in  particular  in  our  discussion  of  stochastic  differential 
equations.  In  fact  the  whole  invention  of  Ito’s  stochastic  calculus  was  motivated  by 
the  study  of  the  Markov  processes  obtained  as  solutions  of  stochastic  differential 
equations,  which  are  also  called  diffusion  processes. 

This  chapter  is  mostly  independent  of  the  previous  ones,  even  though  Brownian 
motion  is  used  as  a  basic  example,  and  the  martingale  theory  developed  in  Chap.  3 
plays  an  important  role.  After  a  section  dealing  with  the  general  definitions  and 
the  problem  of  existence,  we  focus  on  the  particular  case  of  Feller  processes, 
and  in  that  framework  we  introduce  the  key  notion  of  the  generator.  We  establish 
regularity  properties  of  Feller  processes  as  consequences  of  the  analogous  results 
for  supermartingales.  We  then  discuss  the  strong  Markov  property,  and  we  conclude 
the  chapter  by  presenting  three  important  classes  of  Markov  processes. 


6.1  General  Definitions  and  the  Problem  of  Existence 

Let  ( E ,  S)  be  a  measurable  space.  A  Markovian  transition  kernel  from  E  into  E  is  a 
mapping  Q  :  E  x  $  — >  [0, 1]  satisfying  the  following  two  properties: 

(i)  For  every  x  c  E,  the  mapping  $  3  A  i->  Q(x,  A)  is  a  probability  measure  on 
(E,  S). 

(ii)  For  every  A  £  $ ,  the  mapping  E  3  x  \->  Q(x,A )  is  ^-measurable. 
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In  what  follows  we  say  transition  kernel  instead  of  Markovian  transition  kernel. 

Remark  In  the  case  where  E  is  finite  or  countable  (and  equipped  with  the  a -field 
of  all  subsets  of  E ),  Q  is  characterized  by  the  “matrix”  (Q(x,  {y}))*,^. 

If  f  :  E  — >  R  is  bounded  and  measurable  (resp.  nonnegative  and  measurable), 
the  function  Qf  defined  by 


Qf(x)  =  J  Q(x,  dv)/(v) 

is  also  bounded  and  measurable  (resp.  nonnegative  and  measurable)  on  E.  Indeed, 
if  /  is  an  indicator  function,  the  measurability  of  Qf  is  just  property  (ii)  and  the 
general  case  follows  from  standard  arguments. 

Definition  6.1  A  collection  (Qt)t> o  of  transition  kernels  on  E  is  called  a  transition 
semigroup  if  the  following  three  properties  hold. 

(i)  For  every  x  e  E,  Q0(x,  d y)  =  8x(dy). 

(ii)  For  every  s,  t  >  0  and  A  e  S, 

Qt+s(x,A)  =  f  Q,(x,dy)Qs(y,A) 

JE 

(Chapman-Kolmogorov  identity). 

(iii)  For  every  A  e  S,  the  function  (t,x)  i->  Qt(x,A )  is  measurable  with  respect  to 
the  a-field  «2^(R+)  (8)  S. 

Let  B(E )  be  the  vector  space  of  all  bounded  measurable  real  functions  on  E, 
which  is  equipped  with  the  norm  ||/||  =  sup{[/(x)|  :  x  e  E}.  Then  the  linear 
mapping  B(E )  3  f  i->  Qtf  is  a  contraction  of  B(E).  From  this  point  of  view,  the 
Chapman-Kolmogorov  identity  is  equivalent  to  the  relation 

Qt+s  —  QtQs 

for  every  s,t>  0.  This  allows  one  to  view  (Qt)t> o  as  a  semigroup  of  contractions  of 
B(E). 

We  now  consider  a  filtered  probability  space  (X2,  o,oo] ,  P)- 

Definition  6.2  Let  (Qt)t> o  be  a  transition  semigroup  on  E.  A  Markov  process  (with 
respect  to  the  filtration  (J^))  with  transition  semigroup  (Qt)t> o  is  an  (APt) -adapted 
process  (W)r>o  with  values  in  E  such  that,  for  every  s,t  >  0  and /  e  B(E ), 

E\f(Xs+t)  |  &s]  =  Qf(Xs). 

Remark  When  we  speak  about  a  Markov  process  X  without  specifying  the 
filtration,  we  implicitly  mean  that  the  property  of  the  definition  holds  with  the 
canonical  filtration  —  cr(Xr,0  <  r  <  t).  We  may  also  notice  that,  if  X  is  a 
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Markov  process  with  respect  to  a  filtration  (J^),  it  is  automatically  also  a  Markov 
process  (with  the  same  semigroup)  with  respect  to  ( . 

The  definition  of  a  Markov  process  can  be  interpreted  as  follows.  Taking/  =  1a, 
we  have 


P[Xs+t  e  A 


Qt(Xs,A ) 


and  in  particular 


P[Xs+t  e  A 


Xr,0<r<s\  =  Qt(Xs,A). 


Hence  the  conditional  distribution  of  Xs+t  knowing  the  “past”  ( Xr ,  0  <  r  <  s) 
before  time  s  is  given  by  Qt(Xs,  •),  and  this  conditional  distribution  only  depends 
on  the  “present”  state  Xs.  This  is  the  Markov  property  (informally,  if  one  wants  to 
predict  the  future  after  time  s,  the  past  up  to  time  s  does  not  give  more  information 
than  just  the  present  at  time  s). 


Consequences  of  the  definition  Let  y( dx)  be  the  law  of  Xq.  Then  if  0  <  t\  <  h  < 
•  •  •  <  tp  and  Ao ,  A\ , . . . ,  Ap  £  , 

P(Xq  g  Ao,Xtl  £  A\ ,  Xt2  £  A2, . . . ,  Xtp  £  Ap) 


=  y(dx 0)  /  Qh{x 0,6x1)  Qt2-ti(x i,dx2)---/  Qtp-tp_x(xp- i,dxp), 

V  ,Aq  V  A  I  V  A2  «/  iT  n 


More  generally,  if/o,/i  ,...,fpe  B(E), 


E[fo(X0)fi(Xtl)  ■  ■ -fP(Xtp)]  =  J Y (dxo)/o (xo) J Qtl  (xo ,  d*i )/i (-^l ) 

x  J Qt2-t,  (xi ,  d x2)f2(x2)  ■  ■  J Q, ,  (xp- 1 ,  dxp)fp(xi 


) 


This  last  formula  is  derived  from  the  definition  by  induction  on  p.  Note  that, 
conversely,  if  the  latter  formula  holds  for  any  choice  of  0  <  t\  <  1 2  <  •  •  •  <  tp 
and/0,/1, . . .  ,fp  £  B(E ),  then  (Xt)t>0  is  a  Markov  process  of  semigroup  (Qt)t> 0, 
with  respect  to  its  canonical  filtration  & f  —  cr(Xr,0  <  r  <  t)  (use  a  monotone 
class  argument  to  see  that  the  property  of  the  definition  holds  with  —  APf ,  see 
Appendix  Al). 

From  the  preceding  formulas,  we  see  that  the  finite-dimensional  marginals  of  the 
process  X  are  completely  determined  by  the  semigroup  (Qt)t> 0  and  the  law  of  X0 
(initial  distribution). 

Example  If  E  —  MJ,  we  can  take,  for  every  t  >  0  and  x  eWl, 


Qt(x,  d y)  =  ptiy  -  x)  d y 
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where,  for  z  £ 


Mz)  =  (2^0  J/2 


is  the  density  of  the  Gaussian  vector  in  Rd  with  covariance  matrix  tl d.  It  is 
straightforward  to  verify  that  this  defines  a  transition  semigroup  on  Rd,  and  the 
associated  Markov  process  is  d-dimensional  Brownian  motion  (it  would  be  more 
accurate  to  say  pre-Brownian  motion  since  we  have  not  yet  said  anything  about 
sample  paths).  In  the  case  d  —  1,  compare  with  Corollary  2.4. 

We  now  address  the  problem  of  the  existence  of  a  Markov  process  with  a  given 
semigroup.  To  this  end,  we  will  need  a  general  theorem  of  construction  of  random 
processes,  namely  the  Kolmogorov  extension  theorem.  We  give  without  proof  the 
special  case  of  this  theorem  that  is  of  interest  to  us  (a  proof  in  a  more  general  setting 
can  be  found  in  [64,  Chapter  III],  see  also  [47,  Chapter  VII],  and  [49,  Section  2.2] 
for  the  special  case  E  =  R). 

Let  X2*  =  Er+  be  the  space  of  all  mappings  co  :  M+  — >  E.  We  equip  X2*  with 
the  a-field  generated  by  the  coordinate  mappings  co  i->  oo(t)  for  t  G  R+.  Let 
F(R+)  be  the  collection  of  all  finite  subsets  of  R+,  and,  for  every  U  G  F(R+),  let 
7 Tu  \  £2*  — >  Eu  be  the  mapping  which  associates  with  every  co  :  M+  — >  E  its 
restriction  to  U.  If  U,  V  G  F(R+)  and  U  C  V,  we  similarly  write  :  Ev  — >  Eu 
for  the  obvious  restriction  mapping. 

We  recall  that  a  topological  space  is  Polish  if  its  topology  is  separable  (there 
exists  a  dense  sequence)  and  can  be  defined  by  a  complete  metric. 

Theorem  6.3  Assume  that  E  is  a  Polish  space  equipped  with  its  Bore l  o -field  S. 
For  every  U  G  F(R+),  let  pu  be  a  probability  measure  on  Eu .  Assume  that  the 
collection  (pu,  U  G  F(R+))  is  consistent  in  the  following  sense:  If  U  C  V,  gtjj 
is  the  image  of  pty  under  n Then  there  exists  a  unique  probability  measure  fi  on 
(X2*,  J^*)  such  that  Ttuifi)  =  Pufor  every  U  G  F(R+). 


Remark  The  uniqueness  of  p  is  an  immediate  consequence  of  the  monotone  class 
lemma  (cf.  Appendix  Al). 

The  Kolmogorov  extension  theorem  allows  one  to  construct  random  processes 
having  prescribed  finite-dimensional  marginals.  To  see  this,  let  (Xt)t>o  be  the 
canonical  process  on  £2*  : 


Xt(co)  —  oof),  t  >  0. 

If  p  is  a  probability  measure  on  X2*  and  U  —  {t\, ...  ,tp)  G  F(R+),  with  t\  < 
t2  <  •  •  •  <  tp,  then  (Xtl , ...  ,Xtp)  can  be  viewed  as  a  random  variable  with  values 
in  Eu ,  provided  we  identify  Eu  with  Ep  via  the  mapping  co  — >  fo(t\), . . . ,  co{tp)). 
Furthermore,  the  distribution  of  (Xt] , . . . ,  Xtp)  under  p  is  iiufp ).  The  Kolmogorov 
theorem  can  thus  be  rephrased  by  saying  that  given  a  collection  {pv,  U  g  F(R+))  of 
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finite-dimensional  marginal  distributions,  which  satisfies  the  consistency  condition 
(this  condition  is  clearly  necessary  for  the  desired  conclusion),  one  can  construct 
a  probability  measure  ji  on  the  space  X2*,  under  which  the  finite- dimensional 
marginals  of  the  canonical  process  X  are  the  measures  [ijj ,  U  e  F(M+). 

Corollary  6.4  We  assume  that  E  satisfies  the  assumption  of  the  previous  theorem 
and  that  (Qt)t> o  is  a  transition  semigroup  on  E.  Eet  y  be  a  probability  measure 
on  E.  Then  there  exists  a  ( unique )  probability  measure  P  on  T2*  under  which  the 
canonical  process  (Xt)t>o  is  a  Markov  process  with  transition  semigroup  (Qt)t> o 
and  the  law  ofX o  is  y. 

Proof  Let  U  —  {t\,...,tp}  E  F(R+),  with  0  <  t\  <  •••  <  tp.  We  define  a 
probability  measure  Pu  on  Eu  (identified  with  Ep  as  explained  above)  by  setting 


P  (dxi  . . .  d in)  1a(*i  ,  •  •  • ,  Xp) 


f 

=  J y(dxo) JQ, ,  (-to ,  dxi) I Qt2-h  (ai  ,  d x2)  ■  ■  -J Q, (xp- 1 ,  dxp) \A(xi , 


for  any  measurable  subset  A  of  Eu . 

Using  the  Chapman-Kolmogorov  relation,  one  verifies  that  the  measures  Pu 
satisfy  the  consistency  condition.  The  Kolmogorov  theorem  then  gives  the  existence 
(and  uniqueness)  of  a  probability  measure  P  on  12*  whose  finite-dimensional 
marginals  are  the  measures  Pu ,  U  E  F(R+).  By  a  previous  observation,  this  implies 
that  pQ)^>o  is  under  P  a  Markov  process  with  semigroup  (Qt)t>o,  with  respect  to  the 
canonical  filtration.  □ 

For  x  e  E,  let  Px  be  the  measure  given  by  the  preceding  corollary  when  y  —  8X. 
Then,  the  mapping  v  i->  Px  is  measurable  in  the  sense  that  v  i-^  PX(A)  is  measurable, 
for  every  A  e  AP* .  In  fact,  the  latter  property  holds  when  A  only  depends  on  a  finite 
number  of  coordinates  (in  that  case,  there  is  an  explicit  formula  for  PX(A))  and 
a  monotone  class  argument  gives  the  general  case.  Moreover,  for  any  probability 
measure  y  on  E ,  the  measure  defined  by 

P(yM)  =  j  y(dx)Px(A) 

is  the  unique  probability  measure  on  X2*  under  which  the  canonical  process  (W)^>o 
is  a  Markov  process  with  semigroup  (Qt)t> o  and  the  law  of  Xo  is  y. 

Summarizing,  the  preceding  corollary  allows  one  to  construct  (under  a  topologi¬ 
cal  assumption  on  E)  a  Markov  process  (W)^>o  with  semigroup  (Qt)t> o,  which  starts 
with  a  given  initial  distribution.  More  precisely,  we  get  a  measurable  collection  of 
probability  measures  ( Px)xeE  such  that  the  Markov  process  X  starts  from  v  under 
Px.  However,  a  drawback  of  the  method  that  we  used  is  the  fact  that  it  does  not 
give  any  information  on  the  regularity  properties  of  sample  paths  of  X  -  at  present 
we  cannot  even  assert  that  these  sample  paths  are  measurable.  We  will  come  back 
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to  this  question  later,  but  this  will  require  additional  assumptions  on  the  semigroup 
(Qt)t>  o- 

We  end  this  section  by  introducing  the  important  notion  of  the  resolvent. 

Definition  6.5  Let  A  >  0.  The  A -resolvent  of  the  transition  semigroup  (Qt)t> o  is 
the  linear  operator  Rx  :  B(E )  — >  B{E )  defined  by 

poo 

R\f(x)  =  /  e_A,g,/(x)  dt 

Jo 


for /  e  B{E)  and  x  e  E. 

Remark  Property  (iii)  of  the  definition  of  a  transition  semigroup  is  used  here  to  get 
the  measurability  of  the  mapping  t  i->  Qtf(x ),  which  is  required  to  make  sense  of 
the  definition  of  Rxf(x). 

Properties  of  the  resolvent. 

(i)  \\Rxf\\  <  ill/ll- 

(ii)  If  0  <  /  <  1 ,  then  0  <  A Rxf  <  1 . 

(iii)  If  A ,  /x  >  0,  we  have  Rx  —  R^  +  (A  —  fEjRxR^  —  0  ( resolvent  equation). 

Proof  Properties  (i)  and  (ii)  are  very  easy.  Let  us  only  prove  (iii).  We  may  assume 
that  A  ^  /x.  Then, 


Rx(R,J)(x) 


giving  the  desired  result. 


□ 


6. 1  General  Definitions  and  the  Problem  of  Existence 
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Example  In  the  case  of  real  Brownian  motion,  one  verifies  that 

R}f(x)  =  f  n(y-  x)f(y )  dv 


where 


Xt)  dt  —  _ exp(—  \y  —  x\  V2X) . 

v  2A 


A  neat  way  of  getting  the  last  equality  is  to  use  the  formula  E[o~XTa]  —  for 

the  Laplace  transform  of  the  hitting  time  a  >  0  by  a  real  Brownian  motion  started 
from  0  (see  formula  (3.7)).  By  differentiating  with  respect  to  A,  we  get  E[Ta  o~XTa]  — 

(a I  \/7X)zra'^x  and  using  the  density  of  Ta  (Corollary  2.22)  to  rewrite  E[Ta  e~XTa], 
we  exactly  find  the  integral  that  comes  up  in  the  calculation  of  r^iy  —  v). 

A  key  motivation  of  the  introduction  of  the  resolvent  is  the  fact  that  it  allows  one 
to  construct  certain  supermartingales  associated  with  a  Markov  process. 

Lemma  6.6  Let  X  be  a  Markov  process  with  semigroup  (Qt)t> o  with  respect  to  the 
filtration  (J^i).  Let  h  G  B(E )  be  nonnegative  and  let  A  >  0.  Then  the  process 


e-x,Rxh(Xt) 


is  an  (J^)- supermartingale. 

Proof  The  random  variables  e~XtR\h(Xt)  are  bounded  and  thus  in  L1.  Then,  for 
every  s  >  0, 


/’OO 

Jo 


QsR\h  —  I  e  XrQs+,hdt 


and  it  follows  that 


n  OO  /» OO 

e~XsQsRxh=  /  e~X(s+t)Qs+,hdt  =  /  e~Xt Qth  At  <  Rxh 

Jo  Js 


Hence,  for  every  S,t>  0, 


E[e-X(,+s)Rxh(Xt+s)  |  J?,]  =  e-X{,+s)QsRxh(Xt )  <  e-x,Rxh(Xt), 


giving  the  desired  supermartingale  property. 


□ 
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6.2  Feller  Semigroups 


From  now  on,  we  assume  that  E  is  a  metrizable  locally  compact  topological  space. 
We  also  assume  that  E  is  countable  at  infinity,  meaning  that  E  is  a  countable  union 
of  compact  sets.  The  space  E  is  equipped  with  its  Borel  a-field. 

Under  the  previous  assumptions,  it  is  well  known  that  the  space  E  is  Polish. 
Moreover,  one  can  find  an  increasing  sequence  (Kn)n>\  of  compact  subsets  of  E , 
such  that  any  compact  set  of  E  is  contained  in  Kn  for  some  n.  A  function/  :  E  — >  R 
tends  to  0  at  infinity  if,  for  every  e  >  0,  there  exists  a  compact  subset  K  of  E  such 
that  |/(x)|  <  s  for  every  v  £  E\K.  This  is  equivalent  to  requiring  that 

sup  \f(x)\  — »  0. 

x€E\K„  n^°° 

We  let  Co(E)  stand  for  the  set  of  all  continuous  real  functions  on  E  that  tend  to  0 
at  infinity.  The  space  C$(E)  is  a  Banach  space  for  the  supremum  norm 


/II  =  sup|/(x)|. 

x£E 


Definition  6.7  Let  (Qt)t> o  be  a  transition  semigroup  on  E.  We  say  that  (Qt)t> o  is  a 
Feller  semigroup  if: 

(i)  V/  £  C0(E),  Qtf  £  C0(£); 

(ii)  V/  £  C0(E),  112/  -/||  — ►  0  as  t  ->  0. 

A  Markov  process  with  values  in  E  is  a  Feller  process  if  its  semigroup  if  Feller. 

Remark  One  can  prove  (see  for  instance  [70,  Proposition  III. 2.4])  that  condition 
(ii)  can  be  replaced  by  the  seemingly  weaker  property 

V/  £  C0(E),  V*  £  E,  0/W  —►/(*). 

t->  0 

We  will  not  use  this,  except  in  one  particular  example  at  the  end  of  this  chapter. 
Condition  (ii)  implies  that,  for  every  s  >  0, 

lim  \\Qs+tf  ~  QsfW  =  lim  WQAQtf  ~f)\\  =  0 

40  40 

since  Qs  is  a  contraction  of  Cq{E).  We  note  that  the  convergence  is  uniform  when  s 
varies  over  M+ ,  which  ensures  that  the  mapping  t  i-^  2/  is  uniformly  continuous 
from  M+  into  Co(E ),  for  any  fixed  /  £  Co(E). 

In  what  follows,  we  fix  a  Feller  semigroup  (Qt)t> o  on  E.  Using  property  (i)  of 
the  definition  and  the  dominated  convergence  theorem,  one  easily  verifies  that  R\f  £ 
Cq (E)  for  every  /  £  Cq(E)  and  A  >  0. 
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Proposition  6.8  Let  A  >  0,  and  set  &  —  {R\f  :  /  e  C${E)}.  Then  &  does  not 
depend  on  the  choice  A  >  0.  Furthermore,  &  is  a  dense  subspace  ofCo(E). 

Proof  If  A  ^  //,  the  resolvent  equation  gives 

Rxf  =  Rv{f  +  <ji-X)Rxf)- 

Hence  any  function  of  the  form  R\f  with /  e  Cq(E)  is  also  of  the  form  R^g  for  some 
g  e  Co(E).  This  gives  the  first  assertion. 

Clearly,  8%  is  a  linear  subspace  of  Co(E).  To  see  that  it  is  dense,  we  simply  note 
that,  for  every  /  e  Cq(E), 

n  OC  n  OO 

A Rxf  =  A  /  e-x,Qtfdt  =  /  e~'Qt/xfdt  — ►  /  in  C0(E), 

Jo  Jo  A->oo 

by  property  (ii)  of  the  definition  of  a  Feller  semigroup  and  dominated  convergence. 

□ 


Definition  6.9  We  set 

D(L)  =  {f  e  Co(E )  :  — - — —  converges  in  Co(E )  when  1 1  0} 

t 

and,  for  every/  e  D(L), 


Lf  =  lim 

40 


Qtf-f 

t 


Then  D(L )  is  a  linear  subspace  of  Co(E )  and  L  :  D(L)  — >  Co(E )  is  a  linear  operator 
called  the  generator  of  the  semigroup  (Qt)t> o-  The  subspace  D(L)  is  called  the 
domain  of  L. 

Let  us  start  with  two  simple  properties  of  the  generator. 

Proposition  6.10  Letf  e  D(L)  and  s  >  0.  Then  QJ  e  D(L )  andL(Qsf)  —  Qs(Lf). 
Proof  Writing 


Qtm)-  QJ_  _  n  (Qtf-f 

t  ^  V  t 

and  using  the  fact  that  Qs  is  a  contraction  of  Co(E ),  we  get  that  t~l  ( QtiQsf )  —  QJ) 
converges  to  Qs{Lf ),  which  gives  the  desired  result.  □ 

Proposition  6.11  Iff  e  D{L ),  we  have,  for  every  t  >  0, 

d  s=f  +  f  L(Qsf)  ds. 

Jo 


Qtf  =f  +  /  Qs(Lf) 


l 
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Proof  Let /  e  D(L).  For  every  t  >  0, 

s~\Qt+sf  -  Q,f)  =  ))  —►  Q,(Lf). 

e|0 


Moreover,  the  preceding  convergence  is  uniform  when  t  varies  over  R+.  This 
implies  that,  for  every  x  e  E,  the  function  t  i->  Qtf(x )  is  differentiable  on  R+ 
and  its  derivative  is  Qt(Lf)(x ),  which  is  a  continuous  function  of  t.  The  formula  of 
the  proposition  follows,  also  using  the  preceding  proposition.  □ 

The  next  proposition  identifies  the  domain  D(L)  in  terms  of  the  resolvent 
operators  Rx . 

Proposition  6.12  Let  A  >  0. 

(i)  For  every  g  e  C0(E),  Rxg  e  D{L)  and  (A  -  L)Rxg  =  g. 

(ii)  Iff  eD(L),  Rx(X-L)f=f. 

Consequently,  D{L)  —  8%  and  the  operators  Rx  :  Co(E )  ->  8%  and  A  —  L  :  D(L )  -> 
Co(E )  are  the  inverse  of  each  other. 

Proof 

(i)  If  g  e  Co(E ),  we  have  for  every  e  >  0, 


e  l(QsR\g-R\g) 


e  AfG£+/gdf- 


— >  -  g 

8^0 


using  property  (ii)  of  the  definition  of  a  Feller  semigroup  (and  the  fact  that  this 
property  implies  the  continuity  of  the  mapping  t  i->  Qtg  from  M+  into  Co(£)). 
The  preceding  calculation  shows  that  Rxg  e  D(L )  and  L(Rxg)  —  A Rxg  —  g. 

(ii)  Let /  e  D{L).  By  Proposition  6.1 1,  Qf  —  f  +  Jq  Qs(Lf )  ds,  hence 


jf  e"AW(*)  dr  = 

=  /w 

A 


+  j  e  f  Qs(Lf)(x)  ds^dt 


+ 


/ 


OO  Q—ks 


A 


Qs(Lf)(x)  d.v. 


We  have  thus  obtained  the  equality 


A  Rxf=f  +  RxLf 


giving  the  result  in  (ii). 
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The  last  assertions  of  the  proposition  follow  from  (i)  and  (ii):  (i)  shows  that  8%  C 
D(L)  and  (ii)  gives  the  reverse  inclusion,  then  the  identities  in  (i)  and  (ii)  show  that 
Rx  and  A  —  L  are  inverse  of  each  other.  □ 

Corollary  6.13  The  semigroup  (Qt)t> o  is  determined  by  the  generator  L  ( including 
also  the  domain  D(L)). 

Proof  Let  /  be  a  nonnegative  function  in  Co(E).  Then  Ryf  is  the  unique  element 
of  D(L )  such  that  (A  —  L)R\f  —  f.  On  the  other  hand,  knowing  Ryf(x )  = 
/0°°  e~XtQtf(x)dt  for  every  A  >  0  determines  the  continuous  function  t  >->  Qtf(x). 
To  complete  the  argument,  note  that  Qt  is  characterized  by  the  values  of  Qjf  for 
every  nonnegative  function/  in  Co  (E) .  □ 

Example  It  is  easy  to  verify  that  the  semigroup  (Qt)t> o  of  real  Brownian  motion  is 
Feller.  Let  us  compute  its  generator  L.  We  saw  that,  for  every  A  >  0  and  /  G  Co(R), 

=  /  7B 

If  h  G  D(L ),  we  know  that  there  exists  an  /  G  Co(R)  such  that  h  —  Ryf.  Taking 
A  =  j ,  we  have 


exp(— V2A|j  -  x\ )f(y)  dj. 


h(x) 


-f 


exp(— |y  —  x\)f(y)  dy. 


By  differentiating  under  the  integral  sign  (we  leave  the  justification  to  the  reader), 
we  get  that  h  is  differentiable  on  R,  and 


l(x)  =  j 


h'(x)  =  /  sgn(y  — x)  exp(— |j  —  x|)/(j)  d^ 


with  the  notation  sgn(z)  =  1{Z>0}  —  l{z<o}  (the  value  of  sgn(O)  is  unimportant).  Let 
us  also  show  that  h '  is  differentiable  on  R.  Let  xo  e  R.  Then,  for  v  >  xo, 

ti(x)  -  h'(x0)= J  (^sgn(j  -  x)  exp(— |y  -  x\)  -  sgn(j  -  x0)  exp(-|y  -  x0 
=  f  (  -  exp(— |v  —  x|  )  -  exp(—  |v  —  x0 1 ) \f  ( v)  dv 

JX{)  V  7 

+  /  sgnCy-x0)(  exp(— |y  — x|)  -  exp(-\y  -  x0\))f  (y)  dy . 

«/R\[xo,x]  V  7 


It  follows  that 


h'(x 0) 

- > 


— 2/(x0)  +  h(x0). 


x  —  xo 
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We  get  the  same  limit  when  x  \  xo,  and  we  thus  obtain  that  h  is  twice  differentiable, 
and  h "  —  —2 f  +  h. 

On  the  other  hand,  since  h  —  R\/yf,  Proposition  6.12  shows  that 

(l--L)h=f 

hence  Lh  —  —f  +  =  \h!' . 

Summarizing,  we  have  obtained  that 

D(L)  c  {he  C2(M)  :  h  and  h "  e  C0(M)} 

and  that,  if  h  e  D(L),  we  have  Lh  —  \ h 

In  fact,  the  preceding  inclusion  is  an  equality.  To  see  this,  we  may  argue  in  the 
following  way.  If  g  is  a  twice  differentiable  function  such  that  g  and  g "  are  in  Cb(M), 
we  set /  =  \{g  —  gff)  e  Co(M),  then  h  —  R\/zf  e  D(L).  By  the  preceding  argument, 
h  is  twice  differentiable  and  h "  —  —2 f+h.  It  follows  that  (h—g)"  —  h—g.  Since  the 
function  h—g  belongs  to  Cb(M),  it  must  vanish  identically  and  we  get  g  —  h  e  D(L). 

Remark  In  general,  it  is  very  difficult  to  determine  the  exact  domain  of  the 
generator.  The  following  theorem  often  allows  one  to  identify  elements  of  this 
domain  using  martingales  associated  with  the  Markov  process  with  semigroup 
(Qt)t>  o- 

We  consider  again  a  general  Feller  semigroup  (Qt)t>o-  We  assume  that  on  some 
probability  space,  we  are  given,  for  every  v  e  E,  a  process  (Xf)t>o  which  is  Markov 
with  semigroup  (Qt)t>o,  with  respect  to  a  filtration  o,  and  such  that  P(X J  = 

v)  =  1 .  To  make  sense  of  the  integrals  that  will  appear  below,  we  also  assume  that 
the  sample  paths  of  (Xf)t>o  are  cadlag  (we  will  see  in  the  next  section  that  this 
assumption  is  not  restrictive). 

Theorem  6.14  Let  h,  g  e  Co(E).  The  following  two  conditions  are  equivalent: 

(i)  h  e  D(L)  and  Lh  —  g. 

(ii)  For  every  x  G  E,  the  process 


KXf)-  fg(X*)  ds 
Jo 

is  a  martingale,  with  respect  to  the  filtration  (#?). 

Proof  We  first  prove  that  (i)  =>  (ii).  Let  h  e  D(L )  and  g  —  Lh.  By  Proposition  6.1 1, 
we  have  then,  for  every  s  >  0, 


Qsh  =  h  +  f  Qrg  dr. 
Jo 
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It  follows  that,  for  t  >  0  and  s  >  0, 


E[h(XUs)  I  &t\  =  Qsh(X*)  =  h(Xf)  +  [  Qrg(Xx)  dr 

Jo 


On  the  other  hand, 


U 


t-\-s 


g(K)  dr 


■9 


/t-\-s  rt-\-s 

E[g(K)  I  &t]  d r  =  J  Qr-tg(Xf)  dr 


-l 


Qrg(K)  dr. 


The  fact  that  the  conditional  expectation  and  the  integral  can  be  interchanged  (in  the 
first  equality  of  the  last  display)  is  easy  to  justify  using  the  characteristic  property 
of  conditional  expectations.  It  follows  from  the  last  two  displays  that 


* t+s 


HXxt+s)  -  [  gpQdr 
Jo 


=  h(Xf)  -  [  g(X?)dr 
Jo 


giving  property  (ii). 

Conversely,  suppose  that  (ii)  holds.  Then,  for  every  t  >  0, 


h(Xx)  -  f  g(Xxr)dr\=h(x) 
Jo 


and  on  the  other  hand,  from  the  definition  of  a  Markov  process, 


hOC,)  -  f  g(X*)dr  =  Qth(x)  —  f  Qrg(x)dr. 
Jo  J  Jo 


Consequently, 


Qth  —  h  1 


=  7  f  Qrgdr  g 

t  J o  40 


in  Cq(E),  by  property  (ii)  of  the  definition  of  a  Feller  semigroup.  We  conclude  that 
h  G  D(L )  and  Lh  =  g.  □ 

Example  In  the  case  of  4-dimensional  Brownian  motion,  Ito’s  formula  shows  that, 
ifh  e  C2(Rd), 


MX,)  -l-f  Ah(Xs)  d s 
j  Jo 

is  a  continuous  local  martingale.  This  continuous  local  martingale  is  a  martingale  if 
we  furthermore  assume  that  h  and  Ah  are  in  Co(Wl )  (hence  bounded).  It  then  follows 
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from  Theorem  6.14  that  h  e  D(L)  and  Lh  =  \Ah.  Recall  that  we  already  obtained 
this  result  by  a  direct  computation  of  L  when  d  —  1  (in  fact  in  a  more  precise  form 
since  here  we  can  only  assert  that  D(L )  D  {h  e  C2  (MJ)  :  h  and  Ah  e  Co(MJ)}, 
whereas  equality  holds  if  d  —  1). 


6.3  The  Regularity  of  Sample  Paths 

Our  aim  in  this  section  is  to  show  that  one  construct  Feller  processes  in  such  a  way 
that  they  have  cadlag  sample  paths.  We  consider  a  Feller  semigroup  (Qt)t> o  on  a 
topological  space  E  (assumed  to  be  metrizable,  locally  compact  and  countable  at 
infinity  as  above). 

Theorem  6.15  Let  ( Xt)t>o  he  a  Markov  process  with  semigroup  (Qt)t> o>  w7/7z 
respect  to  the  filtration  o,oo]>  Set  =  J^oo  and,  for  every  t  >  0, 


&t  =  & t+  v  o^(yK), 


where  JY  denotes  the  class  of  all  -measurable  sets  that  have  zero  probability. 

Then,  the  process  (Xt)t>o  has  a  cadlag  modification  (Xt)t>o,  which  is  adapted  to 
the  filtration  (J^r).  Moreover,  (Xt)t> o  is  a  Markov  process  with  semigroup  (Qt)t> o> 
with  respect  to  the  filtration  o,oo]- 

Remark  The  filtration  (J^t)  is  right-continuous  because  so  is  the  filtration  (dEt+) 
and  the  right-continuity  property  is  preserved  when  adding  the  class  of  negligible 
sets  JT . 

Proof  Let  Ea  —  E  U  {Z\}  be  the  Alexandroff  compactification  of  E ,  which  is 
obtained  by  adding  the  point  at  infinity  A  to  E  (and  by  definition  the  neighborhoods 
of  A  are  the  complements  of  compact  subsets  of  E).  We  agree  that  every  function 
/  e  Co(E)  is  extended  to  a  continuous  function  on  E&  by  setting /(A)  =  0. 

Write  C q" (E)  for  the  set  of  all  nonnegative  functions  in  Co(E).  We  can  find  a 
sequence  (fn)n> o  in  Cq  (E)  which  separates  the  points  of  E a,  in  the  sense  that,  for 
every  x,y  e  E a  with  v  ^  y,  there  is  an  integer  n  such  that  fn(x)  ^  fn(y)-  Then, 

=  {Rpfn  :p>  \,n  >  0} 

is  also  a  countable  subset  of  C q"  ( E )  which  separates  the  points  of  Ea  (use  the  fact 
that  || pRpf  —f  ||  — >  0  when p  — >  oo). 

If  he  Jjf,  Lemma  6.6  shows  that  there  exists  an  integer  p  >  1  such  that  t~pth{Xt) 
is  a  supermartingale.  Let  D  be  a  countable  dense  subset  of  M+.  Then  Theorem  3.17 
(i)  shows  that  the  limits 


lim  h(Xs)  , 

D3S-\r \t 


lim  h{Xs) 

DDs'l'lt 
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exist  simultaneously  for  every  t  G  M+  (the  second  one  only  for  t  >  0)  outside 
an  -measurable  event  Njt  of  zero  probability.  Indeed,  as  in  the  proof  of 
Theorem  3.17,  we  may  define  the  complementary  event  Nch  as  the  set  of  all  co  g  Q 
for  which  the  function  D  3  s  i->  o~psh{Xs)  makes  a  finite  number  of  upcrossings 
along  any  interval  [a,  b\  (a,  b  G  Q,  a  <  b)  on  every  finite  time  interval.  We  then  set 

N=  (J  Nh 

in  such  a  way  that  we  still  have  N  G  JX .  Then  if  co  £  N,  the  limits 

lim  Xs(co)  ,  lim  Xs(co) 

exist  for  every  t  >  0  (the  second  one  only  for  t  >  0)  in  Ea-  In  fact,  if  we  assume 
that  Xs(co)  has  two  distinct  accumulation  points  in  Ea  when  D  3  s  t,  we  get  a 
contradiction  by  considering  a  function  h  e  3^  that  separates  these  two  points.  We 
can  then  set,  for  every  co  £  £2\N  and  every  t  >  0, 

Xt(co)  =  lim  Xs(co). 

D3S-\r-\rt 

If  co  e  N,  we  set  Xt(co)  =  vo  for  every  t  >  0,  where  Vo  is  a  fixed  point  in  E. 
Then,  for  every  t  >  0,  Xt  is  an  ^-measurable  random  variable  with  values  in  Ea  . 
Furthermore,  for  every  co  e  £2,  t  Xt(co ),  viewed  as  a  mapping  with  values  in  Ea , 
is  cadlag  by  Lemma  3.16  (this  lemma  shows  that  the  functions  t  i->  h(Xt(co )),  for 
h  e  Jtf,  are  cadlag,  and  this  suffices  since  Jt?  separates  points  of  E). 

Let  us  now  show  that  P{Xt  —  Xt)  =  1,  for  every  fixed  t  >  0.  Let/,  g  G  Cq(E) 
and  let  (L)  be  a  sequence  in  D  that  decreases  (strictly)  to  t.  Then, 

E\f(X,)g(X,)\  =  lim  E\f(Xt)g(X,J] 

n— >oo 

=  lim  E\f  {Xt)Qtn-,g(Xt)} 

n— >oo 

-  E\f(X,)g(Xt)] 

since  Qtn-tg  — >  g  by  the  definition  of  a  Feller  semigroup.  The  preceding  equality 
entails  that  the  two  pairs  (Xt,Xt)  and  (Xt,Xt)  have  the  same  distribution  and  thus 
P(Xt=Xt)  =  1. 

Let  us  then  verify  tha/(X?)?>o  is  a  Markov  process  with  semigroup  (Qt)t> o  with 
respect  to  the  filtration  (^).  It  is  enough  to  prove  that,  for  every  s  >  0,  t  >  0  and 
A  g  ^s,f  g  Cq(E ),  we  have 


E[lAf(Xs+t)]  =  E[\a  Qj(Xs)}- 
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Since  Xs  —  Xs  a.s.  and  Xs+t  =  Xs+t  a.s.,  this  is  equivalent  to  proving  that 

E[lAf(Xs+t)]=E[lAQtf(Xs)]. 

Because  A  is  equal  a.s.  to  an  element  of  we  may  assume  that  A  £  Let 
(sn)  be  a  sequence  in  D  that  decreases  to  s ,  so  that  A  £  AfiSn  for  every  n.  Then,  as 
soon  as  sn  <  s  +  t,  we  have 


E[lAf(Xs+t)]  =  E[lAE\f(Xs+t)  |  &J]  =  E[\a  Qs+t-Snf(XSn)\. 


But  Qs+t-sJ  converges  (uniformly)  to  Qtf  by  properties  of  Feller  semigroups,  and 
since  XSn  —  XSn  a.s.  we  also  know  that  XSn  converges  a.s.  to  Xs  —  Xs  a.s.  We  thus 
obtain  the  desired  result  by  letting  n  tend  to  oo. 

It  remains  to  verify  that  the  sample  paths  t  i->  Xt(co)  are  cadlag  as  TLvalued 
mappings,  and  not  only  as  -valued  mappings  (we  already  know  that,  for  every 
fixed  t  ^  0,  Xf{(ii)  —  Xt(o))  a.s.  is  in  E  with  probability  one,  but  this  does  not  imply 
that  the  sample  paths,  and  their  left-limits,  remain  in  E).  Fix  a  function  g  e  C{ q"  (E) 
such  that  g(v)  >  0  for  every  x  e  E.  The  function  h  —  R\g  then  satisfies  the  same 
property.  Set,  for  every  t  >  0, 


Yt  =  e“r h(Xt] ). 

Then  Lemma  6.6  shows  that  (Yt)t> o  is  a  nonnegative  supermartingale  with  respect 
to  the  filtration  (^t).  Additionally,  we  know  that  the  sample  paths  of  (T?)/>o  are 
cadlag  (recall  that  h(A)  —  0  by  convention). 

For  every  integer  n  >  1 ,  set 


T(n)  =  inf{t  >  0  :Yt< 


1 

n 


}• 


Then  is  a  stopping  time  of  the  filtration  (J^y)  (we  can  apply  Proposition  3.9, 
because  T(n)  is  the  first  hitting  time  of  an  open  set  by  an  adapted  process  with  cadlag 
sample  paths,  and  the  filtration  (J^y)  is  right-continuous).  Consequently, 


T  =  lim  t  T(n) 


is  a  stopping  time.  The  desired  result  will  follow  if  we  can  verify  that  P(T  <  oo)  = 
0.  Indeed,  it  is  clear  that,  for  every  t  £  [0,  7^)),  Xt  £  E  and  Xt-  £  E ,  and  we  may 
redefine  Xt(co)  =  xo  (for  every  t  >  0)  for  all  co  belonging  to  the  event  {T  <  oo} 
£  JY . 
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To  verify  that  P(T  <  oo)  =  0,  we  apply  Theorem  3.25  and  the  subsequent 
remark  to  Z  =  Y  and  U  —  T(n),  V  =  T  +  q,  where  q  >  0  is  a  rational  number.  We 
get 


E[YT+q  l{r<oo}]  —  E[Yj{n)  l{r(n)<0o}]  —  ^  • 
By  letting  n  tend  to  oo,  we  thus  have 


E[Yr+q  l{r<oo}]  —  0, 


hence  YT+q  =  0  a.s.  on  {T  <  oo}.  By  the  right-continuity  of  sample  paths  of  Y,  we 
conclude  that  Yt  =  0,  for  every  t  G  [T,  oo),  a.s.  on  {T  <  oo}.  But  we  also  know 
that,  for  every  integer  k  >  0,  Y^  =  o~kh(Xk)  >  0  a.s.,  since  Xk  G  E  a.s.  This  suffices 
to  get  P(T  <  oo)  =  0.  □ 

Remark  The  previous  proof  applies  with  minor  modifications  to  the  different 
setting  where  we  are  given  the  process  (Xr)r>o  together  with  a  collection  ( Px)xeE 
of  probability  measures  such  that,  under  Px,  ( Xt)t>o  is  a  Markov  process  with 
semigroup  (Qt)t> o,  with  respect  to  a  filtration  (J^i)n e[o,oo]»  and  Px(X o  =  x)  =  1 
(in  the  first  section  above,  we  saw  that  these  properties  will  hold  for  the  canonical 
process  (Xr)r>o  on  the  space  =  ER+  if  the  measures  Px  are  constructed  from 
the  semigroup  (Qt)t> o  using  the  Kolmogorov  extension  theorem).  In  that  setting,  we 
can  define  the  filtration  o,<x>]  by 

&t  =  & t+  v 

where  JY'  denotes  the  class  of  all  -measurable  sets  that  have  zero  ^-probability 
for  every  x  e  E.  By  the  same  arguments  as  in  the  preceding  proof,  we  can  then 
construct  an  (J^)- -adapted  process  (W)?>o  with  cadlag  sample  paths,  such  that,  for 
every  v  G  E, 


Px(Xt  =  Xt)  =  1,  Vt>  0, 

and  (Xt)t>o  is  under  Px  a  Markov  process  with  semigroup  (Qt)t> o,  with  respect  to 
the  filtration  {^t)te[ o,oo]»  such  thatPx(Xo  =  x)  =  l. 


6.4  The  Strong  Markov  Property 

In  the  first  part  of  this  section,  we  come  back  to  the  general  setting  of  Sect.  6.1 
above,  where  ( Qt)t>o  is  a  (not  necessarily  Feller)  transition  semigroup  on  E.  We 
assume  here  that  E  is  a  metric  space  (equipped  with  its  Borel  a-field),  and  moreover 
that,  for  every  x  e  E,  one  can  construct  a  Markov  process  (Xf)t>0  with  semigroup 
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(Qt)t> o  such  that  Xq  =  v  a.s.  and  the  sample  paths  of  X  are  cadlag.  In  the  case  of 
a  Feller  semigroup,  the  existence  of  such  a  process  follows  from  Corollary  6.4  and 
Theorem  6.15. 

The  space  of  all  cadlag  paths/  :  M+  — >  E  is  denoted  by  HD  (is).  This  space  is 
equipped  with  the  cr-field  S)  generated  by  the  coordinate  mappings/  i->  f(t).  For 
every  v  G  E,  we  write  Pv  for  the  probability  measure  on  D(Zs)  which  is  the  law  of 
the  random  path  (X/fi>o.  Notice  that  ¥x  does  not  depend  on  the  choice  of  Xx ,  nor  of 
the  probability  space  where  Xx  is  defined:  This  follows  from  the  fact  that  the  finite¬ 
dimensional  marginals  of  a  Markov  process  are  determined  by  its  semigroup  and 
initial  distribution. 

We  first  give  a  version  of  the  (simple)  Markov  property,  which  is  a  simple 
extension  of  the  definition  of  a  Markov  process.  We  use  the  notation  Ex  for  the 
expectation  under  P_v. 

Theorem  6.16  (Simple  Markov  property)  Eet  (Yt)t> o  be  a  Markov  process  with 
semigroup  (Qt)t>  o>  with  respect  to  the  filtration  (J^ t)t>o •  assume  that  the  sample 
paths  of  Y  are  cadlag.  Let  s  >  0  and  let  0  :  B(£)  — >  M+  be  a  measurable 
function.  Then, 


E[<P((Ys+t)t> 0)  |  &s]  =  EYs[0]. 


Remark  The  right-hand  side  of  the  last  display  is  the  composition  of  Ys  and  of  the 
mapping  y  i->  E y[<P].  To  see  that  the  latter  mapping  is  measurable,  it  is  enough  to 
consider  the  case  where  (P  —  l a,  A  e  Tfi.  When  A  only  depends  on  a  finite  number 
of  coordinates,  there  is  an  explicit  formula,  and  an  application  of  the  monotone  class 
lemma  completes  the  argument. 

Proof  As  in  the  preceding  remark,  it  suffices  to  consider  the  case  where  0  =  1  A 
and 


A  =  {/  e  B(E)  :f(h)  eBu...  ,f(tp )  e  Bp}, 

where  0  <  t\  <  t2  <•  -  -  <  tp  and  B\, ...  ,BP  are  measurable  subsets  of  E.  In  that 
case,  we  need  to  verify  that 


P(Xs+t\  £  B i, ... ,  Ys+t  e  Bp  | 


Jb  i 


=  Q'M.dxd  /  Qt2-n(x  i,dx2) 


/ 

Jb2 


l 


Qtp—tp-i  (xp—i  >  dx/;) 


We  in  fact  prove  more  generally  that,  if  (p\ , . . . ,  cpp  e  B(E), 


E&dYs+n)  ■  •  •  <PP(Ys+tp)  I  ^s] 

=  J  Qti(Ys,fcl)<P\(X\)  J  Qi-,-1,  (t| ,  dX2)<P2(X2)  "  /  Vp-\ ,  dxp)<pp(xp) 
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If  p  —  1  this  is  the  definition  of  a  Markov  process.  We  then  argue  by  induction, 
writing 

E[<pi(Ys+ti)  *  *  ’ (PpiXs+tp)  I  ^ ?] 

=  E\(p\(Ys+tl) ' '  *  <Pp—  l (Ys+tp-i)  E[(pp(Ys+tp)  |  EE s-\.tp_x\  |  EE y] 

=  E[(pi(Ys-\-ti)  *  *  *  typ—  l (Xs+tp—i)  Qtp—tp-\(Pp(¥ s+tp-\)  I  <Es\ 


and  the  desired  result  easily  follows.  □ 

We  now  turn  to  the  strong  Markov  property. 

Theorem  6.17  (Strong  Markov  property)  Retain  the  assumptions  of  the  previous 
theorem,  and  suppose  in  addition  that  (Qt)t> o  is  a  Feller  semigroup  ( in  particular, 
E  is  assumed  to  be  metrizable  locally  compact  and  countable  at  infinity).  Let  T  be 
a  stopping  time  of  the  filtration  (EEt+),  and  let  0  :  HD  (Is)  — >  M+  be  a  measurable 
function.  Then,  for  every  x  G  E, 

£[l{r<oo}^((fY+/)f>o)  |  EET\  —  l{r<oo}Eyr[0]. 

Remark  We  allow  T  to  be  a  stopping  time  of  (J^>+),  which  is  slightly  more  general 
than  saying  that  T  is  a  stopping  time  of  the  filtration  (J^). 

Proof  We  first  observe  that  the  right-hand  side  of  the  last  display  is  ^-measurable, 
because  {T  <  oo}  9  a)  ^  YT(co)  is  J^-measurable  (Theorem  3.7)  and  the  function 
y  i->  E  y[(p]  is  measurable.  It  is  then  enough  to  show  that,  for  A  e  EEt  fixed, 

^[lAn{r«x>}^((^r+t)t>o)]  —  £[lAn{r<oo}Eyr[0]]. 

As  above,  we  can  restrict  our  attention  to  the  case  where 

&(f)  =  <Pl  (fit  l))  •  •  •  (Ppifitp )) 

where  0  <  t\  <  E  <  -  <  tp  and  <p\, ...  ,<pp  e  B(E).  It  is  in  fact  enough  to  take 

p  —  1 :  If  the  desired  result  holds  in  that  case,  we  can  argue  by  induction,  writing 

E[\An{T<oo}V\(YT+ti)  •  m  m  <Pp(Yr+tp)\ 

=  E[\An{T<0o}(p\(YT+tl)  •  •  •(pp-\{YT+tp-i)E[(pp{YT+tp)  I  ^T+tp-i]] 

=  E[lAn{T<oo}<Pi(YT+ti)  •  -  - (Pp-iiYT+tp-^Qtp-tp-iVpiYr+tp-i)]- 

We  thus  fix  t  >  0  and  cp  e  B(E )  and  we  aim  to  prove  that 


^[lAn{r«x>}^(^r+t)]  —  E[lAn{T<00}Qt(p(YT)]\. 


170 


6  General  Theory  of  Markov  Processes 


We  may  assume  that  (p  G  Co  (is)  (a  finite  measure  on  E  is  determined  by  its  values 
against  functions  of  Co  (is)). 

On  the  event  T  <  oo,  write  [T\n  for  the  smallest  real  number  of  the  form  i2~n , 
i  G  Z+ ,  which  is  strictly  greater  than  T.  Then, 

E[lAr\{T<oo}V(YT+t)\  =  lim  E[lAn{T<00}(p(Y[T]n+t)\ 

n-+oo 

oo 

—  1™  Jy^JE[lAn{(i-i)2-n<T<i2-n}<p(Yj2-n+t)] 

n — >-oo  “ 

i=  1 

oo 

—  lim  Jy^JE[lAn{(i-l)2-n<T<i2-,1}Qt(p(Yi2-n)] 

i=  1 

=  lim  E[lAn{T<oo)Qt(p(Y[n,)] 

n — >-oo 

=  E[lAn{T<00}Qt(p(YT)\ 

giving  the  desired  result.  In  the  first  (and  in  the  last)  equality,  we  use  the  right 
continuity  of  sample  paths.  In  the  third  equality,  we  observe  that  the  event 

A  n  {/  -  l)2~n  <  T  <  i2~n } 

belongs  to  ~n  because  A  g  and  T  is  a  stopping  time  of  the  filtration  (YPt+) 
(use  Proposition  3.6).  Finally,  and  this  is  the  key  point,  in  the  last  equality  we  also 
use  the  fact  that  Qtcp  is  continuous,  since  ip  G  Cq(E)  and  the  semigroup  is  Feller.  □ 

Remark  In  the  special  case  of  (linear)  Brownian  motion,  the  result  of  Theorem  6.17 
is  essentially  equivalent  to  Theorem  2.20  stated  in  Chap.  2.  The  reason  why  the 
formulation  in  Theorem  2.20  looks  different  comes  from  the  property  of  stationarity 
and  independence  of  the  increments  of  Brownian  motion,  which  of  course  does 
not  subsist  in  our  general  setting.  Even  for  Brownian  motion,  the  formulation 
of  Theorem  6.17  turns  out  to  be  more  appropriate  in  a  number  of  situations:  A 
convincing  illustration  is  the  proof  of  Proposition  7.7  (ii)  in  the  next  chapter. 


6.5  Three  Important  Classes  of  Feller  Processes 
6.5.1  Jump  Processes  on  a  Finite  State  Space 

In  this  subsection,  we  assume  that  the  state  space  E  is  finite  (and  equipped  with  the 
discrete  topology).  Note  that  any  cadlag  function/  g  ID  (is)  must  be  of  the  following 
type:  There  exists  a  real  t\  g  (0,  00]  such  that//)  =  /( 0)  for  every  t  g  [0,  t\),  then, 
if  t\  <  00,  there  exists  a  real  1 2  g  (t\,  00]  such  that//)  =  //1)  ^  /( 0)  for  every 
t  G  [t\,  t2 ),  and  so  on. 
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Fig.  6.1  A  sample  path  of  the  jump  process  X  under  Px 


Consider  a  Feller  semigroup  (Qt)t> o  on  E.  By  the  remark  of  the  end  of  Sect.  6.3, 
we  can  construct,  on  a  probability  space  Q  equipped  with  a  right-continuous 
filtration  (&t)te[ o,<x>]>  a  collection  (. Px)xeE  of  probability  measures  and  a  process 
(Xt)t>o  with  cadlag  sample  paths  such  that,  under  Px,  X  is  Markov  with  semigroup 
(Qt)t> o  with  respect  to  the  filtration  and  Px(X o  =  v)  =  1.  As  previously, 
Ex  stands  for  the  expectation  under  Px.  Since  the  sample  paths  of  X  are  cadlag,  we 
know  that,  for  every  co  G  £2,  there  exists  a  sequence 


Tq((jl>)  =  0  <  T\  ( 00 )  <  T2{co)  <  T^ico)  <  •  •  •  <  oo, 


such  that  Xt(co)  —  Xq (co)  for  every  t  e  [0,  T\{co))  and,  for  every  integer  i  >  1, 
the  condition  Ti(co)  <  oo  implies  Ti(co)  <  Ti+\(a ;),  XTi{co){(p)  and 

Xt(oL>)  —  XTi(a))((o)  for  every  t  G  [Ti(<jo),  Moreover,  Tn(co)  \  oo  as  n  -> 

oo.  See  Fig.  6.1. 

It  is  not  hard  to  verify  that  To,  T\,  T2, . . .  are  stopping  times.  For  instance, 

m  <  ?}  =  u  {xq  ±  x0}  e  &t. 

^e[0,?)nQ 

Recall  that,  for  A  >  0,  a  positive  random  variable  U  is  exponentially  distributed 
with  parameter  A  if  P(U  >  r )  =  e_Ar  for  every  r  >  0.  In  the  following  lemma,  we 
make  the  convention  that  an  exponential  variable  with  parameter  0  is  equal  to  00 
a.s. 

Lemma  6.18  Let  x  G  E.  There  exists  a  real  number  q(x)  >  0  such  that  the  random 
variable  T\  is  exponentially  distributed  with  parameter  q(x)  under  Px.  Furthermore, 
if  q(x)  >  0,  T\  andXTl  are  independent  under  Px. 
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Proof  Let  s,t  >  0.  We  have 


PX(TX  >  S  +  t)  —  Ex[1{Ti>s}  ^(PCs+r)r>o)]> 

where  0(f)  =  l^r)=^0),  Vre[o,r]}  for/  G  B(is).  Using  the  simple  Markov  property 
(Theorem  6.16),  we  get 

PX(TX  >  s  +  t)  =  Ex[1{t1>s}  Exs[@(fXr)r>o]\ 

=  Ex[1{tx>s}Px(Tx  >  0] 

=  Px(T\  >  s)Px(Tl  >  t), 

which  implies  that  7j  is  exponentially  distributed  under  Px. 

Assume  that  q(x)  >  0,  so  that  7j  <  oo,  Px  a.s.  Then,  for  every  t  >  0  andy  G  E , 

Px(T 1  >  6  W|  —  y)  —  Ex[l{Ti>t}  ^(PQ+r)r> o)]» 

where  for/  G  B(is),  0(f)  —  0  is  /  is  constant,  and  otherwise  *£//)  =  l{yi(y)=};},  if 
Y\  (f)  is  the  value  of/  after  its  first  jump.  We  thus  get 


PX(T\  >  t ,  W,  —  y)  —  £%[l{ri>?}  ^x?[^((W)r>o)]] 

=  ^v[l{ri>?}  PxO^Ti  —  t)] 

=  W  >t)Px(XTl  —  y), 


which  gives  the  desired  independence.  □ 

Points  v  such  that  q(x)  —  0  are  absorbing  states  for  the  Markov  process,  in  the 
sense  that  Px(Xt  —  x,  Vf  >  0)  =  1. 

For  every  x  e  E  such  that  q(x)  >  0,  and  every  y  G  £,  we  set 

n(x,y)  =  Px(XTl  =  y). 

Note  that  TJ(x ,  •)  is  a  probability  measure  on  E ,  and  Et(x,  v)  =  0. 

Proposition  6.19  Let  L  denote  the  generator  of  (Qt)t> o.  Then  D(L)  —  C$(E)  — 
B(E),  and,  for  every  cp  G  B(E),for  every  x  G  E: 

•  if  q(x)  —  0,  L<p(x)  —  0; 

•  ifq(x)  >  0, 

Lcp(x)  =  q(x)  yy  n(x,y)(cp(y)  —  cp{x))  =  ^L(x,y)^(y), 

yeE,y^x  y^E 
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where 


L(x,  y) 


q(x)n(x,y )  ify  A  x, 
-q(x)  ify  =  x. 


Proof  Let  ip  e  B(E)  and  x  e  E.  If  q(x)  —  0,  it  is  trivial  that  Qt(p{x )  =  <p(x)  and  so 


lim 

40 


QMx)  -  cp(x) 

t 


Suppose  then  that  q(x)  >  0.  We  first  observe  that 

PAT2  <t)  =  0(A) 


(6.1) 


as  t  ->  0.  Indeed,  using  the  strong  Markov  property  at  7), 

PxiJ 2  <  0  —  Px(Ti  <  t,  T2  <  T\  -\- 1)  =  Ex[l{Tl<tj  PxTl  (T\  <  f)], 
and  we  can  bound 


Pxri  (Ti  <  t)  <  supP^CLi  <t)<t  sup^Cy), 

yeE  yeE 

giving  the  desired  result  since  we  have  also  PX(T\  <  t)  <  g(v)f. 

It  follows  from  (6.1)  that 

Qt<f>(x)  =  Ex[<p(Xt)]  =  Ex[cp(Xt )  lpcrj}]  +  l{ri</}]  +  0(t~) 

=  <p(x)  e~q(x)t  +  (1  -  e~q{x)t)  J2  n^y)  <P(y)  + 

yeE,y^x 


using  the  independence  of  T\  and  XTl  and  the  definition  of  n(x,y).  We  conclude 
that 


Qt<p(x)  ~  <p(x) 

t 


—y  q(x)ip(x)  +  q(x)  V  II(x,y)<p(y), 

yeE,y^x 


and  this  completes  the  proof. 

In  particular,  taking  cp(y)  =  1^},  we  have  if  y  ^  x. 


L(x,  y) 


Px(Xt  =  y)|,=o, 


n 


so  that  L(x,  y)  can  be  interpreted  as  the  instantaneous  rate  of  transition  from  v  to  y. 
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The  next  proposition  provides  a  complete  description  of  the  sample  paths  of  X 
under  Px.  For  the  sake  of  simplicity,  we  assume  that  there  are  no  absorbing  states, 
but  the  reader  will  easily  extend  the  statement  to  the  general  case. 

Proposition  6.20  We  assume  that  q(y)  >  0  for  every  y  e  E.  Let  x  e  E.  Then, 
Px  a.s.,  the  jump  times  T\  <  T2  <  <  •••  are  all  finite  and  the  sequence 

Xo,  Xjx ,  Xj2 , ...  is  under  Px  a  discrete  Markov  chain  with  transition  kernel  T1  started 
from  x.  Furthermore,  conditionally  on  (Xo,Xjx  ,  Xj2, . . .),  the  random  variables 
T\  —  To,  T2  —  T\, . . .  are  independent  and,  for  every  integer  i  >  0,  the  conditional 
distribution  ofTt+i  —  7)  is  exponential  with  parameter  qiXpfi. 

Proof  An  application  of  the  strong  Markov  property  shows  that  all  stopping  times 
T\,  72, . . .  are  finite  Px  a.s.  Then,  let  y,z  £  E ,  and/1,/2  e  5(M+).  By  the  strong 
Markov  property  at  T\, 

Ex[l{xTl=y}fi(Ti)  1{xT2=zMt2  -  TO] 

=  Ex[  1{Xti  =y}fi  ( toeXti  [1{Xti  =zMTi)]] 

p OO  p OO 

=  n(x,y)n(y,z)  dsie-^'MsO  d  s2e-^f2(s2). 

Jo  Jo 

Arguing  by  induction,  we  get  for  every  yu  ...  ,yp  e  E  and/i , ...  ,fp  e  5(M+), 

Ex[l{xTi  =y]}l{XT2=y2}  •  •  •  l{XTp=yp}MTl)f2(T2  -  T\)  •  •  fip(Tp  -  Tp- 0] 

p  / 

=  n(x,yi)n(yi,y2)  •  •  •  n(yp-i,yp)  FT  (  /  d se~q(yi~l) 

i=  1 

where  yo  =  x  by  convention.  The  various  assertions  of  the  proposition  follow.  □ 

Jump  processes  play  an  important  role  in  various  models  of  applied  probability, 
in  particular  in  reliability  and  in  queueing  theory.  In  such  applications,  one  usually 
starts  from  the  transition  rates  of  the  process.  It  is  thus  important  to  know  whether, 
given  a  collection  ( q(x))xeE  of  nonnegative  real  numbers  and,  for  every  v  such 
that  q(x)  >  0,  a  probability  measure  TJ{x,  •)  on  E  such  that  TJ{x,x)  —  0, 
there  exists  a  corresponding  Feller  semigroup  (Qt)t> 0  and  therefore  an  associated 
Markov  process.  The  answer  to  this  question  is  yes,  and  one  can  give  two  different 
arguments: 

•  Probabilistic  method.  Use  the  description  of  Proposition  6.20  (or  its  extension 
to  the  case  where  there  are  absorbing  states)  to  construct  the  process  (Xt)t> 0 
starting  from  any  x  e  E,  and  thus  the  semigroup  (Qt)t>o  via  the  formula 
Qt<p(x)  =  Ex[(p(Xt)\. 
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•  Analytic  method.  Define  the  generator  L  via  the  formulas  of  Proposition  6.19, 
and  observe  that  the  semigroup  (Qt)t> o,  if  it  exists,  must  solve  the  differential 
equation 


—QAx.y)  =  QMx,y) 
at 

by  Proposition  6.11.  This  leads  to 

Qt  =  exp(rL), 

in  the  sense  of  the  exponential  of  matrices.  Since  Aid  +  L  has  nonnegative 
entries  if  A  >  0  is  large  enough,  one  immediately  gets  that  Qt  has  nonnegative 
entries.  Writing  1  for  the  vector  (1,1,.. .,1),  the  property  LI  =  0  gives 
Qtl  —  1,  so  that  ( Qt(x ,  -))xeE  defines  a  transition  kernel.  Finally,  the  property 
exp((s  +  t)  L)  —  exp(sL)  exp(fL)  gives  the  Chapman-Kolmogorov  property, 
and  we  get  that  (Qt)t> o  is  a  transition  semigroup  on  E ,  whose  Feller  property  is 
also  immediate. 

Many  of  the  preceding  results  can  be  extended  to  Feller  Markov  processes  on  a 
countable  state  space  E.  Note,  however,  that  certain  difficulties  arise  in  the  question 
of  the  existence  of  a  process  with  given  transition  rates.  In  fact,  starting  from  the 
probabilistic  description  of  Proposition  6.20,  one  needs  to  avoid  the  possibility  of 
an  accumulation  of  jumps  in  a  finite  time  interval,  which  may  occur  if  the  rates 
(q(y),y  C  E)  are  unbounded  -  of  course  this  problem  does  not  occur  when  E  is 
finite. 


6.5.2  Levy  Processes 

Consider  a  real  process  (T/)/>o  which  satisfies  the  following  three  assumptions: 

(i)  To  —  0  a.s. 

(ii)  For  every  0  <  s  <  t,  the  variable  Yt  —  Ys  is  independent  of  (Yr,  0  <  r  <  s)  and 
has  the  same  law  as  Yt-S. 

(iii)  Yt  converges  in  probability  to  0  when  t  0. 

Two  special  cases  are  real  Brownian  motion  (started  from  0)  and  the  process 
(Ta)a> o  of  hitting  times  of  a  real  Brownian  motion  (cf.  Exercise  2.26). 

Notice  that  we  do  not  assume  that  sample  paths  of  Y  are  cadlag,  but  only  the 
weaker  regularity  assumption  (iii).  The  preceding  theory  will  allow  us  to  find  a 
modification  of  Y  with  cadlag  sample  paths. 

For  every  t  >  0,  we  denote  the  law  of  Yt  by  <2/(0,  dy),  and,  for  every  v  e  R,  we 
let  Qt(x,  dy)  be  the  image  of  <2/(0,  dy)  under  the  translation  y  ^  i  +  y. 
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Proposition  6.21  The  collection  (Qt)t> o  is  a  Feller  semigroup  on  R.  Furthermore, 
(Yt)t> o  is  a  Markov  process  with  semigroup  {Qt)t> o- 

Proof  Let  us  show  that  (Qt)t>o  is  a  transition  semigroup.  Let  ip  G  Z?(R),  s,  t  >  0  and 
xel  Property  (ii)  shows  that  the  law  of  ( Yt ,  Yt+S  —  Yt )  is  the  product  probability 
measure  <2/(0,  •)  (8)  £?y(0,  •)•  Hence, 


Qs(y,dz)<p(z) 


&(0,  d  z)<p(x  +  y  +  z) 


=  E[(p(x  +  Y,  +  (Yt+S  -  Y,))} 


=  E[<p{x  +  y,+s)] 


J  Qt+S(x,dz)<p(z) 


giving  the  Chapman-Kolmogorov  relation.  We  should  also  verify  the  measurability 
of  the  mapping  (t,x)  \->  Qt(x,A ),  but  this  will  follow  from  the  stronger  continuity 
properties  that  we  will  establish  in  order  to  verify  the  Feller  property. 

Let  us  start  with  the  first  property  of  the  definition  of  a  Feller  semigroup.  If 
(p  G  Cb(R),  the  mapping 


X  ^  Qt<p(x)  =  E[<p(x  +  Yt)\ 

is  continuous  by  dominated  convergence,  and,  again  by  dominated  convergence,  we 
have 


E[<p(x+Yt)]  0 

x-+oo 

showing  that  Qtcp  e  Co(R).  Then, 

Qt<p(x)  =  E[(p(x  +  Y,)]  — »  <p(x) 

t~>  0 

thanks  to  property  (iii).  The  uniform  continuity  of  cp  even  shows  that  the  latter 
convergence  is  uniform  in  v.  This  completes  the  proof  of  the  first  assertion  of  the 
proposition.  To  get  the  second  one,  we  write,  for  every  s,  t  >  0  and  every  ip  G  B(W), 

E[<p(Ys+t)  |rr,0<r<s]  =  E[<p(Ys  +  (Fs+,  -  YJ)  |Fr,0<r<^] 

-  j  <p(Ys  +  y )  0,(0,  d y) 

=  J  <P(}’)  Qt(Ys,dy), 

using  property  (ii)  and  the  definition  of  2/(0,  •)  in  the  second  equality.  □ 
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It  then  follows  from  Theorem  6. 15  that  there  exists  a  modification  of  (Yt)t>o  with 
cadlag  sample  paths.  Obviously  this  modification  still  satisfies  (i)  and  (ii). 

A  Levy  process  is  a  process  satisfying  properties  (i)  and  (ii)  above,  and  having 
cadlag  sample  paths  (which  implies  (iii)).  We  refer  to  [3]  for  a  thorough  account  of 
the  theory  of  Levy  processes. 


6.5.3  Continuous-State  Branching  Processes 

A  Markov  process  (Xt)t>o  with  values  in  E  —  M+  is  called  a  continuous- state 
branching  process  if  its  semigroup  (Qt)t>  o  satisfies  the  following  property:  for  every 
v,  y  e  R+  and  t  >  0, 


Qt(x,  •)  *  Qt(y ,  •)  =  Qt(x  +  y,  •), 

where  p  *  v  denotes  the  convolution  of  the  probability  measures  pi  and  v  on  M+ . 
Note  that  this  implies  2/(0,  •)  =  £o  for  every  t  >  0. 

Exercise  Verify  that,  if  X  and  X'  are  two  independent  continuous-state  branching 
processes  with  the  same  semigroup  (Qt)t> o,  then  ( Xt  +  X')*>o  is  also  a  Markov 
process  with  semigroup  (Qt)t> o-  This  is  the  so-called  branching  property:  compare 
with  discrete  time  Galton- Watson  processes. 

Let  us  fix  the  semigroup  (Qt)t>o  of  a  continuous- state  branching  process,  and 
assume  that: 

(i)  &(*,{0})  <  1  for  every  v  >  0  and  t  >  0; 

(ii)  Qt(x,  •)  — >  <5X(-)  when  t  ->  0,  in  the  sense  of  weak  convergence  of  probability 
measures. 

Proposition  6.22  Under  the  preceding  assumptions,  the  semigroup  (Qt)t> o  is 
Feller.  Furthermore,  for  every  A  >  0,  and  every  x  >  0, 

J  Q,(x,  dy)  e~A-v  = 

where  the  functions  ft  •  (0,  oo)  — >  (0,  oo)  satisfy  ft  °  fs  =  ft+s  for  every  s,  t  >  0. 

Proof  Let  us  start  with  the  second  assertion.  If  x,y  >  0,  the  equality  Qt(x,  •)  * 
Qt(y,  0  =  Qt(x  +  y,  0  implies  that 

(  J  Qt(x-  dz)  J  Q,(y,  dz)  =  j  Q,(x  +  y,  dz)  e_Az. 
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Thus  the  function 


log  (  J  Qi(x,  dz)  e  Xz^ 


is  nondecreasing  and  linear  on  R+,  hence  of  the  form  A)  for  some  constant 
f,(X)  >  0  (the  case  \j/t( A)  =  0  is  excluded  by  assumption  (i)).  To  obtain  the  identity 
ft°fs  =  ft+s,  we  write 

j  Qt+S(x,  dz)  e~Xz  —  J  Q,(x.  dy)  J  Qs(y,dz)e~Xz 


=  J  Qt(x, 


dy)  e 


-yfs(^) 


We  still  have  to  prove  that  the  semigroup  is  Feller.  For  every  A  >  0,  set  <px(pc)  — 
q~Xx.  Then, 


Qt<Px  —  <Ptyt{X)  G  C0(M+). 


Furthermore,  an  application  of  the  Stone- Weierstrass  theorem  shows  that  the  vector 
space  generated  by  the  functions  cpx,  A  >  0,  is  dense  in  Cb(M+).  It  easily  follows 
that  Qtcp  e  Co(M+)  for  every  cp  G  Co(M+). 

Finally,  if  ip  G  Cq(M+),  for  every  v  >  0, 


Qt{x,dy)(p(y) 


— t  <^(x) 

t-+0 


by  assumption  (ii).  Using  a  remark  following  the  definition  of  Feller  semigroups, 
this  suffices  to  show  that  \\Qt<p  —  <p ||  — >  0  when  t  — >  0,  which  completes  the  proof. 

□ 

Example  For  every  t  >  0  and  every  v  >  0,  define  Qt(x,dy)  as  the  law  of 
Ci  +  e2  +  •  •  •  +  e#,  where  ei,e2,...  are  independent  random  variables  with 
exponential  distribution  of  parameter  1/t,  and  N  is  Poisson  with  parameter  x/t,  and 
is  independent  of  the  sequence  (e/).  Then  a  simple  calculation  shows  that 

J  Qt(x,  dy)  e-A>’  =  e~xMX) 


where 


f,(X) 


I  +  A  t' 
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Noting  that  \j/t  o  yfrs  —  \lrt+s,  we  obtain  that  (Qt)t>o  satisfies  the  Chapman- 
Kolmogorov  identity,  and  then  that  (Qt)t> o  is  the  transition  semigroup  of  a 
continuous- state  branching  process.  Furthermore,  (Qt)t> o  satisfies  assumptions  (i) 
and  (ii)  above.  In  particular,  the  semigroup  (Qt)t> o  is  Feller,  and  one  can  construct 
an  associated  Markov  process  (X?/>o  with  cadlag  sample  paths.  One  can  in  fact 
prove  that  the  sample  paths  of  (X/fi>o  are  continuous,  and  this  process  is  called 
Feller’s  branching  diffusion,  see  Sect.  8.4.3  below. 


Exercises 


Exercise  6.23  ( Reflected  Brownian  motion)  We  consider  a  probability  space 
equipped  with  a  filtration  (JPt)te[ o,oo]-  Let  a  >  0  and  let  B  =  (Bt)t> o  be  an 
(J^t) -Brownian  motion  such  that  Bq  —  a.  For  every  t  >  0  and  every  z  €  R,  we  set 


Ml)  =  vfe  exp'"S) 


1.  We  set  Xt  —  \Bt\  for  every  t  >  0.  Verify  that,  for  every  s  >  0  and  t  >  0,  for  every 
bounded  measurable  function/  :  R+  — >  R, 


E\f{Xs+t)\^s\  =  Q,f(Xs), 


where  Qcf  —  f  and,  for  every  t  >  0,  for  every  x  >  0, 

poo 

Qtfix )  =  /  (pt(y  -x)+  pt{y  +  xj)f(y)  dy. 

Jo 

2.  Infer  that  (Qt)t>o  is  a  transition  semigroup,  then  that  is  a  Markov  process 

with  values  in  E  —  R+,  with  respect  to  the  filtration  (J^),  with  semigroup 

(Qt)t>  o- 

3.  Verify  that  (Qt)t> o  is  a  Feller  semigroup.  We  denote  its  generator  by  L. 

4.  Let  /  be  a  twice  continuously  differentiable  function  on  R+ ,  such  that  /  and 
f"  belong  to  Cb(R+).  Show  that,  if  f(0)  =  0,/  belongs  to  the  domain  of  L, 
and  Lf  —  \f" •  (Hint:  One  may  observe  that  the  function  g  :  R  ->  R  defined 
by  &(y)  —  /(|y|)  is  then  twice  continuously  differentiable  on  R.)  Show  that, 
conversely,  iff  (0)  f  0,f  does  not  belong  to  the  domain  of  L. 

Exercise  6.24  Let  (Qt)t> o  be  a  transition  semigroup  on  a  measurable  space  E.  Let 
7i  be  a  measurable  mapping  from  E  onto  another  measurable  space  F.  We  assume 
that,  for  any  measurable  subset  A  of  F,  for  every  x,y  e  E  such  that  tx(x)  —  n  (y), 
we  have 


Qt(x.  n  1  (A))  =  Qt(y,  tt  ‘(A)), 
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for  every  t  >  0.  We  then  set,  for  every  z  G  F  and  every  measurable  subset  A  of  F , 
for  every  t  >  0, 


Q',{z,A)  =  Qt(x,  7T  '  (A)) 

where  v  is  an  arbitrary  point  of  E  such  that  Tt(x)  —  z.  We  also  set  Q'0(z,  A)  =  1  a(z). 
We  assume  that  the  mapping  ( t ,  z)  i->  Q't(z,A )  is  measurable  on  M+  x  F,  for  every 
fixed  A. 

1.  Verify  that  (<2j)*>o  forms  a  transition  semigroup  on  F. 

2.  Let  be  a  Markov  process  in  E  with  transition  semigroup  (Qt)t> o  with 

respect  to  the  filtration  o-  Set  Yt  —  n(Xt)  for  every  t  >  0.  Verify  that 

{Yt)t>o  is  a  Markov  process  in  F  with  transition  semigroup  o  with  respect 
to  the  filtration  o- 

3.  Let  (Bt)t>o  be  a  J-dimensional  Brownian  motion,  and  set  Rt  =  \Bt\  for  every 
t  >  0.  Verify  that  (F?)r>0  is  a  Markov  process  and  give  a  formula  for  its 
transition  semigroup.  (The  case  d  —  1  was  treated  via  a  different  approach  in 
Exercise  6.23.) 

In  the  remaining  exercises,  we  use  the  following  notation.  (F,  d )  is  a  locally 
compact  metric  space,  which  is  countable  at  infinity,  and  (Qt)t> o  Is  a  Feller 
semigroup  on  E.  We  consider  an  E-valued process  (Xt)t>o  with  cadlag  sample  paths, 
and  a  collection  (Px)xeE  of  probability  measures  on  E,  such  that,  under  Px,  (Xt)t>o 
is  a  Markov  process  with  semigroup  (Qt)t> o>  with  respect  to  the  filtration  (AEt),  and 
Px(X o  =  x)  —  1.  We  write  Lfor  the  generator  of  the  semigroup  (Qt)t> o>  T)(L)  for 
the  domain  of  L  and  R\  for  the  X-resolvent,  for  every  A  >  0. 

Exercise  6.25  ( Scale  function )  In  this  exercise,  we  assume  that  E  —  M+  and  that 
the  sample  paths  of  X  are  continuous.  For  every  v  e  R+,  we  set 

Tx  inf {t  >  0  :  Xt  —  x) 


and 


<p{x)  :=  Px(To  <  oo). 


1.  Show  that,  if  0  <  v  <  y, 


<p{y)  =  <p(x)Py(Tx  <  oo) 


2.  We  assume  that  <p(x)  <  1  and  Pv(sup/>0 X,  —  +oo)  =  1,  for  every  x  >  0.  Show 
that,  if  0  <  x  <  y, 


Px(To  <  Ty )  - 


<p(x)  -  (p(y) 

i  -<p(y) 
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Exercise  6.26  (Fey nman-Kac  formula)  Let  v  be  a  nonnegative  function  in  Co(E). 
For  every  x  e  E  and  every  t  >  0,  we  set,  for  every  ip  G  B(E ), 


Q*(p{x)  =  Ex  <f(X,)  exp  (-  j  v(Xs)  d.v) 


1.  Show  that,  for  every  ip  g  B(E )  and  s,t  >  0,  Q*+S<p  —  Q*  (Q*  <p)- 

2.  After  observing  that 


1  —  exp  (  —  ^  v(Xs)ds>j  =  s:  v(Xs )  exp  ^  —  J  v(Xr )  dr)  ds 


show  that,  for  every  ip  g  B(E ), 


Qt<p  ~  Q*v  =  [  Qs(v  Q*-S(p)  ds. 

Jo 

3.  Assume  that  ip  g  D(L).  Show  that 


d 
d  t 


Q*<P\t=o  =  Lip  -  vip. 


Exercise  6.27  ( Quasi  left-continuity)  Throughout  the  exercise  we  fix  the  starting 
point  x  e  E.  For  every  t  >  0,  we  write  Xt-(co)  for  the  left- limit  of  the  sample  path 
s  i->  Xs(co)  at  t. 

Let  (Tn)n> i  be  a  strictly  increasing  sequence  of  stopping  times,  and  T  =  lim  f 
Tn.  We  assume  that  there  exists  a  constant  C  <  oo  such  that  T  <  C.  The  goal  of  the 
exercise  is  to  verify  that  XT-  =  XT,  Px  a.s. 

1.  Let  /  G  D(L)  and  h  —  Lf.  Show  that,  for  every  n  >  1, 


EMXT)\^Tn\=f(XTn)  +  Ex 


2.  We  recall  from  the  theory  of  discrete  time  martingales  that 


Ex\f(XT)  |  &Tn] 


^  Ex[f(Xr)  |  &T] 

n-+oo 


where 
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Infer  from  question  (1)  that 


EMXt)  I  &t\  =f(Xr- )• 


3.  Show  that  the  conclusion  of  question  (2)  remains  valid  if  we  only  assume  that 
/  G  Co(E ),  and  infer  that,  for  every  choice  off,  g  G  Co(E ), 

Ex\f(XT)g(XT-)]  =  Ex\f{XT.)g{XT. _)]. 

Conclude  that  XT-  —  XT,  Px  a.s. 

Exercise  6.28  ( Killing  operation)  In  this  exercise,  we  assume  that  X  has  continuous 
sample  paths.  Let  A  be  a  compact  subset  of  E  and 

Ta  =  inf{t  >  0  :  X,  G  A}. 

1.  We  set,  for  every  t  >  0  and  every  bounded  measurable  function  on  E , 

Qt<P(x)  =  Ex[cp(Xt)  1  {t<TA}\  ,  Vx  G  E. 

Verify  that  Q*+S<p  =  Q*  (Q*<p),  for  every  s,t  >  0. 

2.  We  set  E  —  (E\A)  U  {A},  where  A  is  a  point  added  to  E\A  as  an  isolated  point. 
For  every  bounded  measurable  function  <p  on  E  and  every  t  >  0,  we  set 

Qt(p(x)  =  Ex[<p(Xt)  1  {t<TA}\  +  Px[Ta  <  t]  <p(A)  ,  if  v  G  E\A 

and  Qtcp(A)  —  <p(A).  Verify  that  (Qt)t> o  is  a  transition  semigroup  on  E.  ( The 
proof  of  the  measurability  of  the  mapping  (t,x)  i-^  Qt(p(x)  will  be  omitted.) 

3.  Show  that,  under  the  probability  measure  Px,  the  process  X  defined  by 


[Xt  if  t<TA 
[A  if  t  >  Ta 


is  a  Markov  process  with  semigroup  (Qt)t>o,  with  respect  to  the  canonical 
filtration  of  X. 

4.  We  take  it  for  granted  that  the  semigroup  (Qt)t>o  is  Feller,  and  we  denote  its 
generator  by  L.  Let /  G  D(L)  such  that /  and  Lf  vanish  on  an  open  set  containing 
A.  Write  /  for  the  restriction  off  to  E\A,  and  consider/  as  a  function  on  E  by 
setting /(A)  =  0.  Show  that /  G  D(L)  and  Lf(x)  =  Lf(x)  for  every  v  G  E\A. 

Exercise  6.29  {Dynkin’ s  formula) 

1.  Let  g  g  Co(£)  and  x  e  E,  and  let  T  be  a  stopping  time.  Justify  the  equality 


l{r<oo}e 


—XT 


e  Xt g(XT+t)  dt 


Ex[1{t<0o}  e~XT R\g(XT)\. 
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2.  Infer  that 


Rxg(x)  =  Ex 


z-x,g(Xt)  df 


+  £*[l{r<oo} e  XT Rxg(Xr)]- 


3.  Show  that,  if  f  e  D(L), 


fix)  =  Ex 


e-A,(A/-L/)(X,)df 


+  £,[l{r<oo}  e~XTf(XT)]. 


4.  Assuming  that  EX[T]  <  oo,  infer  from  the  previous  question  that 


=  EMXt)]  -fix). 


How  could  this  formula  have  been  established  more  directly? 

5.  For  every  s  >  0,  we  set  TByX  =  inf{£  >  0  :  d(x,Xt)  >  s}.  Assume  that  EX[TS  ,}  < 
oo,  for  every  sufficiently  small  s.  Show  that  (still  under  the  assumption/  e  D(L )) 
one  has 


Ef(x)  —  lim 

ef  0 


EAfiXrJ]  -/(*) 
Ex[T£'X] 


6.  Show  that  the  assumption  Ex\Te  x\  <  oo  for  every  sufficiently  small  s  holds  if  the 
point  v  is  not  absorbing,  that  is,  if  there  exists  a  t  >  0  such  that  Qt(x,  {v})  <  1. 
(Hint:  Observe  that  there  exists  a  nonnegative  function  h  G  Q  (E)  which  vanishes 
on  a  ball  centered  at  v  and  is  such  that  Qth(x)  >  0.  Infer  that  one  can  choose 
a  >  0  and  r]  g  (0,  1)  such  that  Px(Tax  >  nt)  <  (1  —  r])n  for  every  integers  >  1.) 


Notes  and  Comments 

The  theory  of  Markov  processes  is  a  very  important  area  of  probability  theory. 
Markov  processes  have  a  long  history  that  would  be  too  long  to  summarize  here. 
Dynkin  and  Feller  played  a  major  role  in  the  development  of  the  theory  (see 
in  particular  Dynkin’ s  books  [20,  21]).  We  limited  our  treatment  to  the  minimal 
material  needed  for  our  later  applications  to  stochastic  differential  equations.  Our 
treatment  of  Feller  processes  is  inspired  by  the  corresponding  chapters  in  [70]  and 
[71].  We  chose  to  focus  on  Feller  semigroups  because  this  special  case  allows 
an  easy  presentation  of  key  notions  such  as  the  generator,  and  at  the  same  time 
it  includes  the  main  examples  we  consider  in  this  book.  The  reader  interested  in 
the  more  general  theory  of  Markov  processes  may  have  a  look  at  the  classical 
books  of  Blumenthal  and  Getoor  [5],  Meyer  [59]  and  Sharpe  [73].  The  idea  of 
characterizing  a  Markov  process  by  a  collection  of  associated  martingales  (in  the 
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spirit  of  Theorem  6.14)  has  led  to  the  theory  of  martingale  problems,  for  which 
we  refer  the  reader  to  the  classical  book  of  Stroock  and  Varadhan  [77].  Martingale 
problems  are  also  discussed  in  the  book  [24]  of  Ethier  and  Kurtz,  which  focuses 
on  problems  of  characterization  and  convergence  of  Markov  processes,  with  many 
examples  and  applications.  Markov  processes  with  a  countable  state  space  are 
treated,  along  with  other  topics,  in  the  more  recent  book  [76]  of  Stroock.  We  refer 
to  the  monograph  [3]  of  Bertoin  for  a  modern  presentation  of  the  theory  of  Levy 
processes. 


Chapter  7 

Brownian  Motion  and  Partial  Differential 
Equations 


In  this  chapter,  we  use  the  results  of  the  preceding  two  chapters  to  discuss 
connections  between  Brownian  motion  and  partial  differential  equations.  After  a 
brief  discussion  of  the  heat  equation,  we  focus  on  the  Laplace  equation  Au  —  0  and 
on  the  relations  between  Brownian  motion  and  harmonic  functions  on  a  domain  of 
W1.  In  particular,  we  give  the  probabilistic  solution  of  the  classical  Dirichlet  problem 
in  a  bounded  domain  whose  boundary  satisfies  the  exterior  cone  condition.  In  the 
case  where  the  domain  is  a  ball,  the  solution  is  made  explicit  by  the  Poisson  kernel, 
which  corresponds  to  the  density  of  the  exit  distribution  of  the  ball  for  Brownian 
motion.  We  then  discuss  recurrence  and  transience  of  d-dimensional  Brownian 
motion,  and  we  establish  the  conformal  invariance  of  planar  Brownian  motion  as  a 
simple  corollary  of  the  results  of  Chap.  5.  An  important  application  is  the  so-called 
skew-product  decomposition  of  planar  Brownian  motion,  which  we  use  to  derive 
several  asymptotic  laws,  including  the  celebrated  Spitzer  theorem  on  Brownian 
windings. 


7.1  Brownian  Motion  and  the  Heat  Equation 

Throughout  this  chapter,  we  let  B  stand  for  a  d-dimensional  Brownian  motion  that 
starts  from  v  under  the  probability  measure  Px,  for  every  v  e  Rd  (one  may  use 
the  canonical  construction  of  Sect.  2.2,  defining  Px  as  the  image  of  Wiener  measure 
W(dw)  under  the  translation  w  i-^  v  +  w).  Then  (Bt)t> 0  is  a  Feller  process  with 
semigroup 


|y-*l2 

it 


)  <p(y)  dy, 
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for  ip  g  B(Rd).  We  write  L  for  the  generator  of  this  Feller  process.  If  /  is  a  twice 
continuously  differentiable  function  on  W 1  such  that  both  /  and  Afr  belong  to 
Co(M^),  then  fs  e  D(L)  and  L\jr  —  \Afr  (see  the  end  of  Sect.  6.2). 

If  ip  g  then,  for  every  fixed  t  >  0,  Qtip  can  be  viewed  as  the  convolution 

of  ip  with  the  C°°  function 


p,(x)  =  (2 nt)  d/2 


)• 


It  follows  that  Qtip  is  also  a  C°°  function.  Furthermore,  if  <p  G  Co  (MJ),  differentia¬ 
tion  under  the  integral  sign  shows  that  all  derivatives  of  Qt<p  also  belong  to  Co(Rd). 
It  follows  that  we  have  Qtip  G  D(L )  and  L(Qtip )  =  ^A(Qtip),  for  every  t  >  0. 

Theorem  7.1  Let  ip  G  Cq(Mj).  For  every  t  >  0  and  x  G  W i,  set 


ut(x)  =  Qt<p(x)  =  £'JC[(^(^)]- 


Then,  the  function  (ut(x))t>0xeRd  solves  the  partial  differential  equation 


dt 


on  (0,  oo)  x  W1.  Furthermore,  for  every  x  G  Wl, 


lim  us(y)  =  99  (x). 
sfO 


Proof  By  the  remarks  preceding  the  theorem,  we  already  know  that,  for  every 
t  >  0,  ut  is  a  C°°  function,  ut  G  D(L),  and  Lz^  =  |z\z^.  Let  £  >  0.  By  applying 
Proposition  6.11  to/  =  u£ ,  we  get  for  every  t  >  e, 


ds. 


Since  Lus  =  Qs-s(Lus)  depends  continuously  on  s  G  [e,  00),  it  follows  that,  for 
t  >  e, 


dut 

dt 


The  last  assertion  is  just  the  fact  that  Qsip  — >  ip  as  s  0.  □ 

Remark  We  could  have  proved  Theorem  7. 1  by  direct  calculations  from  the  explicit 
form  of  Qttp ,  and  these  calculations  imply  that  the  same  statement  holds  if  we 
only  assume  that  <p  is  bounded  and  continuous.  The  above  proof  however  has  the 
advantage  of  showing  the  relation  between  this  result  and  our  general  study  of 
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Markov  processes.  It  also  indicates  that  similar  results  will  hold  for  more  general 
equations  of  the  form  ^  =  Au  provided  A  can  be  interpreted  as  the  generator  of  an 
appropriate  Markov  process. 

Brownian  motion  can  be  used  to  provide  probabilistic  representations  for  solu¬ 
tions  of  many  other  parabolic  partial  differential  equations.  In  particular,  solutions 
of  equations  of  the  form 


du  1 

—  =  -  Au  —  v  u, 
3 1  2 


where  v  is  a  nonnegative  function  on  W1,  are  expressed  via  the  so-called  Feynman- 
Kac  formula:  See  Exercise  7.28  below  for  a  precise  statement. 


7.2  Brownian  Motion  and  Harmonic  Functions 

Let  us  now  turn  to  connections  between  Brownian  motion  and  the  Laplace  equation 
Au  —  0.  We  start  with  a  classical  definition. 

Definition  7.2  Let  D  be  a  domain  of  W1 .  A  function  u  :  D  — >  R  is  said  to  be 
harmonic  on  D  if  it  is  twice  continuously  differentiable  and  Au  —  0  on  D. 

Let  Df  be  a  subdomain  of  D  whose  closure  is  contained  in  D.  Consider  the 
stopping  time  T  inf{t  >  0  :  Bt  £)'}.  An  application  of  Ito’s  formula  (justified 
by  the  remark  preceding  Proposition  5.11)  shows  that,  if  u  is  harmonic  on  D ,  then 
for  every  v  G  D\  the  process 


pt  AT 

u(Bt aT)  =  u(B0)  +  /  Vu(Bs)  •  d Bs  (7.1) 

Jo 

is  a  local  martingale  under  Px  [here  and  later,  to  apply  the  stochastic  calculus  results 
of  Chap.  5,  we  let  be  the  canonical  filtration  of  B  completed  under  Px\. 

So,  roughly  speaking,  harmonic  functions  are  functions  which  when  composed 
with  Brownian  motion  give  (local)  martingales. 

Proposition  7.3  Let  u  be  harmonic  on  the  domain  D.  Let  D'  be  a  bounded 
subdomain  of  D  whose  closure  is  contained  in  D,  and  consider  the  stopping  time 
T  inf{t  >  0  :  Bt  £  D'}.  Then,  for  every  x  e  D', 

u(x)  =  Ex[u(Bt)\. 

Proof  Since  D'  is  bounded,  both  u  and  Wu  are  bounded  on  D\  and  we  also  know 
that  PX(T  <  oo)  =  1  for  every  v  G  D' .  It  follows  from  (7.1)  that  u(BtAT )  is  a  (true) 
martingale,  and  in  particular,  we  have 


u(x)  =  Ex[u(Btat)\ 
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for  every  x  e  D'.  By  letting  t  ->  oo  and  using  dominated  convergence  we  get  that 

u(x)  =  EiKflr)] 


for  every  x  e  D'.  □ 

The  preceding  proposition  easily  leads  to  the  mean  value  property  for  harmonic 
functions.  In  order  to  state  this  property,  first  recall  that  the  uniform  probability 
measure  on  the  unit  sphere,  denoted  by  0\  (dy),  is  the  unique  probability  measure  on 
{y  e  Wl  :  |y|  =  1}  that  is  invariant  under  all  vector  isometries.  For  every  v  e  W1  and 
r  >  0,  we  then  let  ax  r(dy)  be  the  image  of  0\  (dy)  under  the  mapping  y  \-^  x  -\-  ry. 

Proposition  7.4  (Mean  value  property)  Suppose  that  u  is  harmonic  on  the 
domain  D.  Then,  for  every  x  e  D  and  for  every  r  >  0  such  that  the  closed  hall 
of  radius  r  centered  at  x  is  contained  in  D,  we  have 


u 


<7*,r(dy)  u(y). 


Proof  First  observe  that,  if  T\  —  inf{t  >  0  :  \Bt\  —  1},  the  distribution  of  BTl 
under  Pq  is  invariant  under  all  vector  isometries  of  W1  (by  the  invariance  properties 
of  Brownian  motion  stated  at  the  end  of  Chap.  2)  and  therefore  this  distribution  is 
0\{dy).  By  scaling  and  translation  invariance,  it  follows  that  for  every  v  e  W1  and 
r  >  0,  if  Tx  r  —  inf{f  >  0  :  \Bt  —  x\  =  r},  the  distribution  of  BTxr  under  Px  is  ax  r. 
However,  Proposition  7.3  implies  that,  under  the  conditions  in  the  proposition,  we 
must  have  u(x)  —  Ex[u(BTxr)].  The  desired  result  follows.  □ 

We  say  that  a  (locally  bounded  and  measurable)  function  u  on  D  satisfies  the 
mean  value  property  if  the  conclusion  of  Proposition  7.4  holds.  It  turns  out  that  this 
property  characterizes  harmonic  functions. 

Lemma  7.5  Let  ube  a  locally  bounded  and  measurable  function  on  D  that  satisfies 
the  mean  value  property.  Then  u  is  harmonic  on  D. 

Proof  Fix  r0  >  0  and  let  D'  be  the  open  subset  of  D  consisting  of  all  points  whose 
distance  to  Dc  is  greater  than  r0.  It  is  enough  to  prove  that  u  is  twice  continuously 
differentiable  and  Au  —  0  on  Dr .  Let  h  :  R  — >  M+  be  a  C°°  function  with  compact 
support  contained  in  (0,  r0)  and  not  identically  zero.  Then,  for  every  x  e  D'  and 
every  re  (0,  r0),  we  have 


u 


ox, My)  u(y)- 


We  multiply  both  sides  of  this  equality  by  rd~lh(r )  and  integrate  with  respect 
to  Lebesgue  measure  dr  on  (0,ro).  Using  the  formula  for  integration  in  polar 
coordinates,  and  agreeing  for  definiteness  that  u  =  0  on  Dc ,  we  get,  for  every  x  e  D', 


c  u 


«=  f 

J{\y\<ro) 


dyh(\y\)  w(v  +  y) 


-x\)u(z), 
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where  c  >  0  is  a  constant  and  we  use  the  fact  that  h(\z  —  x|)  =  0  if  \z  —  x\  >  r q. 
Since  z  i — >  /x(|z|)  is  a  C°°  function,  the  convolution  in  the  right-hand  side  of  the  last 
display  also  defines  a  C°°  function  on  D'. 

It  remains  to  check  that  Au  —  0  on  D'.  To  this  end,  we  use  a  probabilistic 
argument  (an  analytic  argument  is  also  easy).  By  applying  Ito’s  formula  to  u(Bt) 
under  Px,  we  get,  for  v  e  D'  and  r  e  (0,  ro), 


Ex[u{BtATxr)\  =  u(x)  +  Ex  I  d s  Au(Bs) 

fo 


a 


tAT 


x,r 


If  we  let  t  — >  oo,  noting  that  £*[7^]  <  oo  (cf.  example  (b)  after  Corollary  3.24), 
we  get 


Ex[u(BTxr)\  =  u(x)  +EX  I  d  sAu(Bs) 

'o 


[£ 


XJ 


The  mean  value  property  just  says  that  Ex[u(B Txr)\  =  u(x)  and  so  we  have 


ds  Au(Bs) 


Since  this  holds  for  any  r  G  (0,  r0)  it  follows  that  Au(x)  =  0.  □ 

From  now  on,  we  assume  that  the  domain  D  is  bounded. 

Definition  7.6  (Classical  Dirichlet  problem)  Let  g  be  a  continuous  function  on 
dD.  A  function  u  :  D  — >  R  solves  the  Dirichlet  problem  in  D  with  boundary 
condition  g,  if  u  is  harmonic  on  D  and  has  boundary  condition  g ,  in  the  sense  that, 
for  every  y  e  dD,  u(x)  — >  g(y)  as  x  ^  y,  x  e  D. 

Recall  that  D  stands  for  the  closure  of  D.  If  u  solves  the  Dirichlet  problem  with 
boundary  condition  g,  the  function  u  defined  on  D  by  u(x)  —  u(x)  if  x  e  D,  and 
u(x)  —  g{x)  if  v  G  dD,  is  then  continuous,  hence  bounded  on  D. 

Proposition  7.7  Let  D  be  a  bounded  domain,  and  write  T  —  infiJ4  >  0  :  Bt  ^  D} 
for  the  exit  time  of  Brownian  motion  from  D. 

(i)  Let  g  be  a  continuous  function  on  dD,  and  let  u  be  a  solution  of  the  Dirichlet 
problem  in  D  with  boundary  condition  g.  Then,  for  every  x  G  D, 


u(x)  =  Ex[g(BT)]. 

(ii)  Let  g  be  a  bounded  measurable  function  on  dD.  Then  the  function 


u(x)  =  Ex[g(BT)],  x  e  D, 


is  harmonic  on  D. 
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Remark  Assertion  (i)  implies  that,  if  a  solution  to  the  Dirichlet  problem  with 
boundary  condition  g  exists,  then  it  is  unique,  which  is  also  easy  to  prove  using 
the  mean  value  property. 

Proof 

(i)  Fix  x  e  D  and  so  >  0  such  that  the  ball  of  radius  £o  centered  at  v  is  contained 
in  D.  For  every  s  E  (0,  £o),  let  De  be  the  connected  component  containing  v  of 
the  open  set  consisting  of  all  points  of  D  whose  distance  to  Dc  is  greater  than  s. 
If  Ts  —  inf {t  >  0  :  Bt  Ds},  Proposition  7.3  shows  that 

u(x)  =  Ex[u(BTs)\. 

Now  observe  that  Ts  \  T  as  s  ^  0  (if  T'  is  the  increasing  limit  of  Te  as  e  i  0, 
we  have  T'  <  T  and  on  the  other  hand  Bj'  E  dD  by  the  continuity  of  sample 
paths).  Using  dominated  convergence,  it  follows  that  Ex[u(BTe)\  converges  to 
Ex[g{BT)\  as  e  ->  0. 

(ii)  By  Lemma  7.5,  it  is  enough  to  verify  that  the  function  u(x )  =  Ex[g(BT)]  satisfies 
the  mean  value  property.  Recall  the  notation  Tx  r  =  inf {t  >  0  :  \Bt  —  x\  =  r } 
for  v  E  W1  and  r  >  0.  Fix  x  e  D  and  r  >  0  such  that  the  closed  ball  of  radius 
r  centered  at  v  is  contained  in  D.  We  apply  the  strong  Markov  property  in  the 
form  given  in  Theorem  6. 17:  With  the  notation  of  this  theorem,  we  let  0( w),  for 
w  e  C(R+,  W1)  such  that  w(0)  e  D ,  be  the  value  of  g  at  the  first  exit  point  of 
w  from  D  (we  take  0(w)  =  0  if  w  never  exits  D)  and  we  observe  that  we  have 

§(Bt)  =  <P(Bt,  t  >  0)  =  <P{BTxtT+u  t  >  0),  Px  a.s. 

because  the  paths  (Bt,t  >  0)  and  (Brxr+t,t  >  0)  have  the  same  exit  point 
from  D.  It  follows  that 


u(x)  =  Ex[g(BT)]  =  Ex[0(BTxr+t,  t  >  0)]  =  Ex[EBTsr  [<P(Bt,  t  >  0)]]  =  Ex[u(BTJ]. 

Since  we  know  that  the  law  of  Brsr  under  Px  is  crX)r,  this  gives  the  mean  value 
property.  □ 

Part  (i)  of  Proposition  7.7  tells  us  that  the  solution  of  the  Dirichlet  problem  with 
boundary  condition  g,  if  it  exists,  is  given  by  the  probabilistic  formula  u(x)  — 
Ex[g(BT)\.  On  the  other  hand,  for  any  choice  of  the  (bounded  measurable)  function 
g  on  dD ,  part  (ii)  tells  us  that  the  probabilistic  formula  yields  a  function  u  that  is 
harmonic  on  D.  Even  if  g  is  assumed  to  be  continuous,  it  is  however  not  clear  that 
the  function  u  has  boundary  condition  g,  and  this  need  not  be  true  in  general  (see 
Exercises  7.24  and  7.25  for  examples  where  the  Dirichlet  problem  has  no  solution). 
We  state  a  theorem  that  gives  a  partial  answer  to  this  question. 

If  y  E  dD ,  we  say  that  D  satisfies  the  exterior  cone  condition  at  y  if  there  exist 
a  (nonempty)  open  cone  ^  with  apex  y  and  a  real  r  >  0  such  that  the  intersection 
of  with  the  open  ball  of  radius  r  centered  at  y  is  contained  in  Dc .  For  instance, 
a  convex  domain  satisfies  the  exterior  cone  condition  at  every  point  of  its  boundary. 


7.2  Brownian  Motion  and  Harmonic  Functions 


191 


Theorem  7.8  (Solution  of  the  Dirichlet  problem)  Let  D  be  a  bounded  domain 
in  M.d.  Assume  that  D  satisfies  the  exterior  cone  condition  at  every  y  G  dD.  Then, 
for  every  continuous  function  g  on  dD,  the  formula 

u(x )  =  Ex[g(Bj )]  ,  where  T  —  inf {t  >  0  :  Bt  £  D), 

gives  the  unique  solution  of  the  Dirichlet  problem  with  boundary  condition  g. 

Proof  Thanks  to  Proposition  7.7  (ii),  we  only  need  to  verify  that,  for  every  fixed 
y  G  dD, 


X- 


lim  u(x)  =  g(y). 

->y,xC:D 


(7.2) 


Let  s  >  0.  Since  g  is  continuous,  we  can  find  8  >  0  such  that  we  have  |g(z)  —g(y)  |  < 
s/3  whenever  z  £  dD  and  \z  —  y\  <  8.  Let  M  >  0  be  such  that  \g(z)\  <  M  for  every 
z  e  dD.  Then,  for  every  77  >  0, 


u(x)  -  g(v)  <  Ex[\g(BT)  -  ^ty)|l{r<,,}]  +  Ex[\g(BT)  -  I l{r>r?}] 
—  Ex[\g(Bx)  ^Cy)|l{r<i)}l{sup{|B,-jc|:»<ij}<«/2}] 


+2MPX( 


sup  |  Bt  —  x 

t<rj 


+  2  MPx(T>r]). 


Write  A\ ,  A2,  A3  for  the  three  terms  in  the  sum  in  the  right-hand  side  of  the  last 
display.  We  assume  that  \y—x\  <  8/ 2,  and  we  bound  successively  these  three  terms. 
First  note  that  we  have  \BT  —  y\  <  \BT  —  x\  +  \y  —  x\  <  8  on  the  event 


{T  <q}  n  sup{ \Bf  x 


•  t<rj}<  8/2}, 


and  our  choice  of  8  ensures  that  A 1  <  s/3. 
Then,  translation  invariance  gives 


=  2 MP0(  sup  |  B 

V  t<rj 


> 


5 


which  tends  to  0  when  rj  >  0  by  the  continuity  of  sample  paths.  So  we  can  fix  q  >  0 
so  that  A2  <  s/3. 

Finally,  we  claim  that  we  can  choose  a  g  (0,5/2]  small  enough  so  that  we 
also  have  A3  =  2 M PX(T  >  q)  <  s/3  whenever  \x  —  y\  <  a.  It  follows  that 
| u(x)  —  g(y)|  <  s,  whenever  \x  —  y\  <  a,  thus  completing  the  proof  of  (7.2).  Thus  it 
only  remains  to  prove  our  claim,  which  is  the  goal  of  the  next  lemma.  □ 

Lemma  7.9  Under  the  exterior  cone  condition,  we  have  for  every  y  G  dD  and  every 

rj  >  0, 


lim  PX(T  >  rj)  =  0. 
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Proof  For  every  u  e  W1  with  |  u 
cone 


1  and  every  y  e  (0,  1),  consider  the  circular 


ff(u,  y)  {z  e  Rd  :  z  •  u  >  (1  —  y)\z\}, 

where  z  •  u  stands  for  the  usual  scalar  product.  If  y  e  dD  is  given,  the  exterior  cone 
condition  means  that  we  can  fix  r  >  0,  u  and  y  such  that 

y  +  (X( u ,  y)  n  SSj)  c  £>c, 

where  denotes  the  open  ball  of  radius  r  centered  at  0.  To  simplify  notation,  we 
set  *€  —  ^(u,y)  Pi  ^r,  and  also 

=  tf(u,  -)  n  ssrj 2 

which  is  the  intersection  of  a  smaller  cone  with  ^r/2- 

For  every  open  subset  F  of  W1 ,  write  TF  =  inf{f  >  0  :  Bt  e  F}.  An  application 
of  Blumenthal’s  zero-one  law  (Theorem  2.13,  or  rather  its  easy  extension  to 
d-dimensional  Brownian  motion)  along  the  lines  of  the  proof  of  Proposition  2.14  (i) 
shows  that  Pq{T^{u,y/2)  =  0)  =  1  and  hence  Po(T^  —  0)  =  1  by  the  continuity  of 
sample  paths.  On  the  other  hand,  set  ^  =  {z  e  c&'  :  \z\  >  cl),  for  every  a  e  (0,  r/2). 
The  sets  increase  to  c&'  as  a  1  0,  and  thus  we  have  TV  1  TV  —  0  as  a  l  0,  Pq 
a.s.  Hence,  given  any  f}  >  0  we  can  fix  a  small  enough  so  that 


Po(TK  <  i?)  >  l-p. 


Recalling  that  y  +  ^  C  fic,  we  have 


PX(T  <r])>  Px(Ty+v  <  rj) 


P()(Ty-X  +  tf  <  ^). 


However,  a  simple  geometric  argument  shows  that,  as  soon  as  \y—x\  is  small  enough, 
the  shifted  cone  y  —  x  +  ^  contains  and  therefore 


PX(T  <r])>  P0(TK  <rj)>l-p. 

Since  ft  was  arbitrary,  this  completes  the  proof.  □ 

Remark  The  exterior  cone  condition  is  only  a  sufficient  condition  for  the  existence 
(and  uniqueness)  of  a  solution  to  the  Dirichlet  problem.  See  e.g.  [69]  for  necessary 
and  sufficient  conditions  that  ensure  the  existence  of  a  solution  for  any  continuous 
boundary  value. 
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7.3  Harmonic  Functions  in  a  Ball  and  the  Poisson  Kernel 

Consider  again  a  bounded  domain  D  and  a  continuous  function  g  on  3D.  Let  T  = 
inf {t  >  0  :  Bt  £  D}  be  the  exit  time  of  D  by  Brownian  motion.  Proposition  7.7  (i) 
shows  that  the  solution  of  the  Dirichlet  problem  in  D  with  boundary  condition  g ,  if 
it  exists  (which  is  the  case  under  the  assumption  of  Theorem  7.8),  is  given  by 


u(x)  =  Ex[g(BT)\  =  /  cu(x,  civ)  g(y), 

J  dD 

where,  for  every  x  e  D,  co(x,  dy)  denotes  the  distribution  of  BT  under  Px.  The 
measure  co(x,  dy)  is  a  probability  measure  on  3D  called  the  harmonic  measure  of 
D  relative  to  v.  In  general,  it  is  hopeless  to  try  to  find  an  explicit  expression  for  the 
measures  co(x,  dy).  It  turns  out  that,  in  the  case  of  balls,  such  an  explicit  expression 
is  available  and  makes  the  representation  of  solutions  of  the  Dirichet  problem  more 
concrete. 

From  now  on,  we  suppose  that  D  =  88  \  is  the  open  unit  ball  in  Rd .  We  also 
assume  that  d  >  2  to  avoid  trivialities.  The  boundary  d88\  is  the  unit  sphere  in  W1. 

Definition  7.10  The  Poisson  kernel  (of  the  unit  ball)  is  the  function  K  defined  on 
88  \  x  3  88 \  by 


K(x,y) 


2 


for  every  x  E  88\  and  y  e  d88\ . 

Lemma  7.11  For  every  fixed  y  e  d88\,  the  function  x  ^  K(x,y)  is  harmonic 
on  88 

Proof  Set  Ky(x )  =  K(x,  y)  for  x  E  88\ .  Then  Ky  is  a  C°°  function  on  88 \ .  Moreover 
a  (somewhat  tedious)  direct  calculation  left  to  the  reader  shows  that  AKy  —  0 
on  88 \ .  □ 

In  view  of  deriving  further  properties  of  the  Poisson  kernel,  the  following  lemma 
about  radial  harmonic  functions  will  be  useful. 

Lemma  7.12  Let  0  <  r\  <  r^  he  two  real  numbers  and  let  h  :  (n,r2)  ->  R 
be  a  measurable  function.  The  function  u(x)  —  h(\x\)  is  harmonic  on  the  domain 
{x  G  W1  :  r\  <  \x\  <  r 2}  if  and  only  if  there  exist  two  constants  a  and  b  such  that 

ht  \  —  \  a  tfd  —  2, 

(  a  +  b  r2~d  if  d  >  3. 


Proof  Suppose  that  u(x)  —  h(\x\)  is  harmonic  on  {x  G  Wl  :  r\  <  \x\  <  r2}. 
Then  u  is  twice  continuously  differentiable  and  so  is  h.  From  the  expression  of  the 
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Laplacian  of  a  radial  function,  we  get  that  Au  —  0  if  and  only  if 

h!\r )  +  - — - h'(r )  =  0  ,  re  (n,  r2). 
r 

The  solutions  of  this  second  order  linear  differential  equations  are  the  functions  of 
the  form  given  in  the  statement.  The  lemma  follows.  □ 

Recall  our  notation  <7i(dy)  for  the  uniform  probability  measure  on  the  unit 
sphere  d&\ . 

Lemma  7.13  For  every  x  e  FS\, 


f  K(x,y)  cri(dy)  =  1 


Proof  For  every  x  e  ,  set 


F(x)  =  f  K(x,y)  <Ti  (dy), 

Jd£§  i 


Then  the  preceding  lemma  implies  that  F  is  harmonic  on  .  Indeed,  if  x  e  88 \  and 
r  <  1  —  |x| ,  Lemma  7.11  and  the  mean  value  property  imply  that,  for  every  y  e  , 


K(z,y)  (Jx,r(dz). 


Hence,  using  Fubini’s  theorem, 


K(z,y)  a  i  (dy)^ax,r(dz) 

K(z,  y)  aA  r(dz)  ja i  (dy)  =  j  K(x,y)  at{dy)  =  Fix) , 


showing  that  the  mean  value  property  holds  for  F. 

If  x/r  is  a  vector  isometry  of  Mrf,  we  have  K(\[s(x),  y/f(y))  =  K{x,y)  for  every 
x  e  F$\  and  y  e  dB i ,  and  the  fact  that  o\  (dy)  is  invariant  under  xfr  implies  that 
F(\jf(x))  —  F(x)  for  every  x  e  F%\.  Hence  F  is  a  radial  harmonic  function  and 
Lemma  7.12  (together  with  the  fact  that  F  is  bounded  in  the  neighborhood  of  0) 
implies  that  F  is  constant.  Since  F(0)  =  1,  the  proof  is  complete.  □ 

Theorem  7.14  Let  g  be  a  continuous  function  on  dF$\.  The  unique  solution  of  the 
Dirichlet  problem  in  with  boundary  condition  g  is  given  by 


u 


g(y)  K(x,y)  G\  (dy)  ,  x  e  . 
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Proof  The  very  same  arguments  as  in  the  beginning  of  the  proof  of  Lemma  7.13 
show  that  u  is  harmonic  on  88  \ .  To  verify  the  boundary  condition,  fix  yo  £  d8S\ . 
For  every  3  >  0,  the  explicit  form  of  the  Poisson  kernel  shows  that,  if  x  e  88\  and 
y  G  d88\  are  such  that  \x  —  yo|  <3/2  and  \y  —  yol  >3,  then 


K(x,y)<(j)d(  1 


It  follows  from  this  bound  that,  for  every  3  >  0, 


lim 


x->y0,xem  J{\y_yo\>8} 


L 


K(x,y)  <7 1  (dy)  =  0. 


(7.3) 


Then,  given  s  >  0,  we  can  choose  8  >  0  sufficiently  small  so  that  the  conditions 
y  e  3^i  and  |y-y0|  <  8  imply  |g(y)  -g(yo)|  <  e.  If  M  =  sup{|g(y)|  :  y  e  d&i}, 
it  follows  that 


/ 


«W  -  g(vo) I  =  K(pc,y )  (g(y)  -  g(y0))  ffi  (dy) 


<  2  M 


L 


K(x,y)  a  i  (dv)  +  e, 


(ly— vol><5} 


using  Lemma  7.13  in  the  first  equality,  and  then  our  choice  of  3.  Thanks  to  (7.3),  we 
now  get 


lim  sup  |  u(x)  —  g(yo)|  5  £. 

X— yXE&i 

Since  e  was  arbitrary,  this  yields  the  desired  boundary  condition.  □ 

The  preceding  theorem  allows  us  to  identify  the  harmonic  measures  of  the 
unit  ball. 

Corollary  7.15  Let  T  —  inf{t  >  0  :  Bt  88\}.  For  every  x  G  88 \y  the  distribution 
ofBj  under  Px  has  density  K(x,  y)  with  respect  to  0\  (dy). 

This  is  immediate  since,  by  combining  Proposition  7.7  (i)  with  Theorem  7.14, 
we  get  that,  for  any  continuous  function  g  on  d88\ , 

EMBt)]  =  f 

Jd.Sg\ 


g(y )  K(x, y) ffi (dy)  ,  Vi  e 
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7.4  Transience  and  Recurrence  of  Brownian  Motion 


We  consider  again  a  d-dimensional  Brownian  motion  {Bt)t> o  that  starts  from  v  under 
the  probability  measure  Px.  We  again  suppose  that  d  >  2,  since  the  corresponding 
results  for  d  —  1  have  already  been  derived  in  the  previous  chapters. 

For  every  a  >  0,  we  introduce  the  stopping  time 


Ua  =  inf{f  >  0  :  \Bt\  =  a}, 


with  the  usual  convention  inf  0  =  oo. 


Proposition  7.16  Suppose  thatx  ^  0,  and  let  s  and  R  be  such  that  0  <  s  < 
Then, 


x 


<  R. 


log^  —  log  \x\  . 


Px(Ue  <UR)={ 


log  R  —  log  £ 


p~~d  _ 


2-d 


K  d  _  £^  ^ 


if  d  —  2, 


if  d  >  3. 


(7.4) 


Consequently,  we  have  Px(Uo  <  oo)  =  0  and  for  every  s  e  (0,  |v|), 


PX(U£  <  oo)  =  < 


(l  ifd  =  2, 

£  \  d—2 


if  d  >  3, 


Proof  Write  DeR  for  the  annulus  {y  e  Rd  :  £  <  \y\  <  R}.  Let  u(x)  be  the  function 
defined  for  v  e  Ds  R  that  appears  in  the  right-hand  side  of  (7.4).  By  Lemma  7.12, 
u  is  harmonic  on  Ds  R,  and  it  is  also  clear  that  u  solves  the  Dirichlet  problem  in 
Ds  R  with  boundary  condition  g(y)  =  0  if  \y\  =  R  and  g(y)  =  1  if  \y\  =  £.  If 
Te  R  denotes  the  first  exit  time  from  Ds  R,  Proposition  7.7  shows  that  we  must  have 
u(x)  —  Ex[g(BTsR )]  for  every  v  e  Ds  R.  Formula  (7.4)  follows  since  Ex[g(BTeR)\  — 
PX(U£  <  Ur). 

If  R  ^  \x\  is  fixed,  the  event  |  1/q  ^  is  ( Px  a.s.)  contained  in  fU£  ^  UR )■,  for 
every  0  <  £  <  \x\.  By  passing  to  the  limit  £  ->  0  in  the  right-hand  side  of  (7.4), 
we  thus  get  that  Px(Uo  <  UR)  —  0.  Since  Ur  \  oo  as  R  \  oo,  it  follows  that 
PX(U0  <  oo)  =  0. 

Finally,  we  have  also  PX(U£  <  oo)  =  lim  PX(UE  <  UR)  as  R  ->  oo,  and 
by  letting  R  ->  oo  in  the  right-hand  side  of  (7.4)  we  get  the  stated  formula  for 
PX(U£  <  oo).  □ 

Remark  The  reader  will  compare  formula  (7.4)  with  the  exit  distribution  from  an 
interval  for  real  Brownian  motion  that  was  derived  in  Chap.  4  (example  (a)  after 
Corollary  3.24).  We  could  have  proved  (7.4)  in  a  way  similar  to  what  we  did  for 
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its  one-dimensional  analog,  by  applying  the  optional  stopping  theorem  to  the  local 
martingale  log  \Bt\  (if  d  —  2)  or  \Bt\2~d  (if  d  —  3).  See  Exercise  5.33. 

For  every  y  e  R^,  set  ry  —  inf{f  >  0  :  Bt  =  y},  so  that  in  particular  To  =  Uq.  The 
property  Px{z o  <  oo)  =  0  for  v  ^  0  implies  that  Px( xy  <  oo)  =  0  whenever  y  ^  x, 
by  translation  invariance.  This  means  that  the  probability  for  Brownian  motion  to 
visit  a  fixed  point  other  than  its  starting  point  is  zero:  one  says  that  points  are  polar 
for  d-dimensional  Brownian  motion  with  d  >  2  (see  Exercise  7.25  for  more  about 
polar  sets). 

If  m  denotes  Lebesgue  measure  on  Wl ,  it  follows  from  Fubini’s  theorem  that 


Ex\nk({But>  0})]  =  Ex 


<  oo)  =  0, 


and  therefore  m({Z^,  t  >  0})  =  0,  Px  a.s.  One  can  nonetheless  prove  that  the 
Hausdorff  dimension  of  the  curve  {Bt,  t  >  0}  is  equal  to  2  in  any  dimension  d  >  2 
(see  e.g.  [62]).  In  some  sense,  this  shows  that  the  planar  Brownian  curve  is  “not  so 
far”  from  having  positive  Lebesgue  measure. 

Theorem  7.17 

(i)  In  dimension  d  —  2,  Brownian  motion  is  recurrent,  meaning  that  almost  surely, 
for  every  nonempty  open  subset  O  ofW1,  the  set  {t  >  0  :  Bt  e  0}  is  unbounded. 

(ii)  In  dimension  d  >  3,  Brownian  motion  is  transient,  meaning  that 


lim 

t^oo 


Bt 


—  oo 


a.s. 


Proof 

(i)  It  is  enough  to  prove  that  the  statement  holds  when  O  is  an  open  ball  of  rational 
radius  centered  at  a  point  with  rational  coordinates.  So  it  suffices  to  consider 
a  fixed  open  ball  38  and  we  may  assume  that  38  is  centered  at  0  and  that  the 
starting  point  of  B  is  v  ^  0.  By  Proposition  7. 16  we  know  that  Brownian  motion 
will  never  hit  0  (so  that  inf{|Z?r|  :  0  <  r  <  t}  >  0  for  every  t  >  0,  a.s.)  but  still 
will  hit  any  open  ball  centered  at  0.  It  follows  that  B  must  visit  38  at  arbitrarily 
large  times,  a.s. 

(ii)  Again  we  can  assume  that  the  starting  point  of  B  is  v  ^  0.  Since  the  function 

y  \y\2~d  is  harmonic  on  MJ\{0},  and  since  we  saw  that  B  does  not  hit 
0,  we  get  that  \Bt\2~d  is  a  local  martingale  and  hence  a  supermartingale  by 
Proposition  4.7.  By  Theorem  3.19  (and  the  fact  that  a  positive  supermartingale 
is  automatically  bounded  in  L1),  we  know  that  \Bt\2~d  converges  a.s.  as  t  ->  oo. 
The  a.s.  hmit  must  be  zero  (otherwise  the  curve  :  t  >  0}  would  be  bounded!) 
and  this  says  exactly  that  \Bt\  converges  to  oo  as  t  ->  oo.  □ 

Remark  In  dimension  d  —  2,  one  can  (slightly)  reinforce  the  recurrence  property 
by  saying  that  a.s.  for  every  nonempty  open  subset  O  of  M2,  the  Lebesgue  measure 
of  {t  >  0  :  Bt  G  O)  is  infinite.  This  follows  by  a  straightforward  application  of  the 
strong  Markov  property,  and  we  omit  the  details. 
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7.5  Planar  Brownian  Motion  and  Holomorphic  Functions 

In  this  section,  we  concentrate  on  the  planar  case  d  —  2,  and  we  write  Bt  —  (Xt,  Yt) 
for  a  two-dimensional  Brownian  motion.  It  will  be  convenient  to  identify  M2  with 
the  complex  plane  C,  so  that  Bt  —  Xt  +  i Yt,  and  we  sometimes  say  that  B  is  a 
complex  Brownian  motion .  As  previously  B  starts  from  z  under  the  probability  Pz, 
for  every  z  €  C. 

If  0  :  C  — >  C  is  a  holomorphic  function,  the  real  and  imaginary  parts  of  0  are 
harmonic  functions,  and  thus  we  know  that  the  real  and  imaginary  parts  of  0(5?) 
are  continuous  local  martingales.  In  fact,  much  more  is  true. 

Theorem  7.18  Let  0  :  C  — >  C  be  a  nonconstant  holomorphic  function.  For  every 
t  >  0,  set 


ds. 


Let  z  e  C.  There  exists  a  complex  Brownian  motion  T  that  starts  from  0(z)  under 
Pz,  such  that 


< P(Bt )  =  rCt  ,  for  every  t  >  0,  Pz  a.  s. 


In  other  words,  the  image  of  complex  Brownian  motion  under  a  holomorphic 
function  is  a  time-changed  complex  Brownian  motion.  This  is  the  conformal 
invariance  property  of  planar  Brownian  motion.  It  is  possible  (and  useful  for 
many  applications)  to  extend  Theorem  7.18  to  the  case  where  0  is  defined  and 
holomorphic  in  a  domain  D  of  C  (such  that  z  £  D).  A  similar  representation  then 
holds  for  <P(Bt)  up  to  the  first  exit  time  of  D  (see  e.g.  [18]). 

Proof  Let  g  and  h  stand  respectively  for  the  real  and  imaginary  parts  of  0 .  Since  g 
and  h  are  harmonic,  an  application  of  Ito’s  formula  gives  under  Pz, 


and  similarly 


g{B,)  =  g(z)  + 


r‘  dh 

h(Bt)  =  h(z)+  /  —  (SJ)dXJ  + 

Jo  OX 


So  Mt  —  g(Bt )  and  Nt  —  h(Bt)  are  local  martingales.  Moreover,  the  Cauchy 
Riemann  equations 
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give 


(M,N)t  =  0 


and 


(M,  M)t 


0,(fiJ)|2ds  =  Ct. 


The  recurrence  of  planar  Brownian  motion  implies  that  =  ooa.s.  (take  a  ball  88 

where  |  €>'  |  is  bounded  below  by  a  positive  constant,  and  note  that  the  total  time  spent 
by  B  in  the  ball  88  is  a.s.  infinite).  We  can  then  apply  Proposition  5.15  to  Mt  —  g(z) 
and  Nt  —  h(z )  under  Pz,  and  we  find  two  independent  real  Brownian  motions  /3  and 
y  started  from  0  such  that  Mt  —  g(z)  +  J8q  and  Nt  =  h(z)  +  yct,  for  every  t  >  0, 
a.s.  The  desired  result  follows  by  setting  rt  =  <P(z)  +  fit  +  i yt.  □ 

We  will  apply  the  conformal  invariance  property  of  planar  Brownian  motion 
to  its  decomposition  in  polar  coordinates,  which  is  known  as  the  skew-product 
representation. 

Theorem  7.19  Let  z  E  C\{0}  and  write  z  —  exp(r  +  i 0)  where  r  e  R  and  0  e 
(—7 r,  7 r].  There  exist  two  independent  linear  Brownian  motions  and  y  that  start 
respectively  from  r  and  from  0  under  Pz,  such  that  we  have  Pz  a.s.  for  every  t  >  0, 


B,  =  exp(; 6h,  +  i  yh,), 


where 


Proof  The  “natural”  method  for  proving  Theorem  7.19  would  be  to  apply  a 
generalized  version  of  Theorem  7.18  to  a  suitable  determination  of  the  complex 
logarithm.  This,  however,  leads  to  some  technical  difficulties,  and  for  this  reason  we 
will  argue  differently. 

We  may  assume  that  z  —  1  (and  thus  r  —  0  —  0).  The  general  case  can  be 
reduced  to  that  one  using  scaling  and  rotational  invariance  of  Brownian  motion.  Let 
Tt  —  Ttl  +  i  Tt2  be  a  complex  Brownian  motion  started  from  0.  By  Theorem  7.18, 
we  have  a.s.  for  every  t  >  0, 


exp(rf)  =  ZCl, 


(7.5) 


where  Z  is  a  complex  Brownian  motion  started  from  1,  and  for  every  t  >  0, 
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Let  (Hs,s  >  0)  be  the  inverse  function  of  (Ct,  t  >  0),  so  that,  by  the  formula  for  the 
derivative  of  an  inverse  function, 


d  u 


using  the  fact  that  exp(L/) ) 
now  get 


in  the  last  equality.  By  (7.5)  with  t  —  Hs,  we 


Zs  =  exp  (r^  +  i  r£). 

This  is  the  desired  result  (since  L1  and  r 2  are  independent  linear  Brownian  motions 
started  from  0)  except  we  did  not  get  it  for  B  but  for  the  complex  Brownian  motion 
Z  introduced  in  the  course  of  the  argument. 

To  complete  the  proof,  we  argue  as  follows.  Write  arg  Bt  for  the  continuous 
determination  of  the  argument  of  Bt  such  that  arg  Bo  —  0  (this  makes  sense  since 
we  know  that  B  does  not  visit  0,  a.s.).  The  statement  of  Theorem  7.19  (with  z  =  1) 
is  equivalent  to  saying  that,  if  we  set 

fit  log;  |^inf{s>0:/0v  \BU\— 2du>t}  ’ 

Yt  ^inf{s>0: /0V  | Bu \~2d u>t}  ’ 


then  ft  and  y  are  two  independent  real  Brownian  motions  started  from  0.  Note  that 
P  and  y  are  deterministic  functions  of  B ,  and  so  their  law  must  be  the  same  if  we 
replace  B  by  the  complex  Brownian  motion  Z.  This  gives  the  desired  result.  □ 

Let  us  briefly  comment  on  the  skew-product  representation.  By  writing  Ht  as  the 
inverse  of  its  inverse,  we  get 


Ht  —  inf{s  >  0  :  f  exp(2/3w)  du  >  t},  (7.6) 

Jo 


and  it  follows  that 


log  |  Bt 


P 


inf{s>0: Jq  exp(2/3w)  d u>t} 


1 


showing  that  \B\  is  completely  determined  by  the  linear  Brownian  motion  /3.  This  is 
related  to  the  fact  that  \Bt\  is  a  Markov  process,  namely  a  two-dimensional  Bessel 
process  (cf.  Exercise  6.24,  and  Sect.  8.4.3  for  a  brief  discussion  of  Bessel  processes). 

On  the  other  hand,  write  6t  =  ar gBt  =  yHt.  Then  9t  is  not  a  Markov  process: 
At  least  intuitively,  this  can  be  understood  by  the  fact  that  the  past  of  0  up  to  time 
t  gives  information  on  the  current  value  of  \Bt\  (for  instance  if  6t  oscillates  very 
rapidly  just  before  t  this  indicates  that  \Bt\  should  be  small)  and  therefore  on  the 
future  evolution  of  the  process  0 . 
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In  this  section,  we  apply  the  skew-product  decomposition  to  certain  asymptotic 
results  for  planar  Brownian  motion.  We  fix  the  starting  point  z  £  C\{0}  (we  will 
often  take  z  —  1)  and  for  simplicity  we  write  P  instead  of  Pz.  We  keep  the  notation 
ft  =  ar gBt  for  a  continuous  determination  of  the  argument  of  Bt.  Although  the 
process  ft  is  not  a  Markov  process,  the  fact  that  it  can  be  written  as  a  linear  Brownian 
motion  time-changed  by  an  independent  increasing  process  allows  one  to  derive  a 
lot  of  information  about  its  path  properties.  For  instance,  since  Ht  — >  oo  as  t  ->  oo, 
we  immediately  get  from  Proposition  2.14  that,  a.s., 


lim  sup  ft  =  -foo, 


lim  inf  ft  =  — oo. 


One  may  then  ask  about  the  typical  size  of  ft  when  t  is  large.  This  is  the  celebrated 
Spitzer  theorem  on  the  winding  number  of  planar  Brownian  motion. 

Theorem  7.20  Let  ( ft  ,  t  ^  0)  be  a  continuous  determination  oj "* the  argument  oj "* the 
complex  Brownian  motion  B  started  from  z  £  C\{0}.  Then 


2 

log* 


ft 


converges  in  distribution  as  t  ->  oo  to  a  standard  symmetric  Cauchy  distribution. 
In  other  words,  for  every  x  E  R, 


/  2  „ 

lim  Pi - ft 

t^oo  Wogt 


dp 

7r(l  +  y2) ' 


Before  proving  the  theorem,  we  will  establish  a  key  lemma.  Without  loss  of 
generality,  we  may  assume  that  z  —  1  and  ft)  =  0.  We  use  the  notation  of 
Theorem  7.19,  so  that  /3  and  y  are  two  independent  linear  Brownian  motions  started 
from  0. 

Lemma  7.21  For  every  A  >  0,  consider  the  scaled  Brownian  motion  =  j-  /3^2t, 
for  every  t  >  0,  and  set  rjA)  =  inf{t  >  0  :  =  1}.  Then 


(log  tf 


Ht  -  T\ 


((log  0/2) 


■>  0 


-oo 


in  probability. 

Remark  This  shows  in  particular  that  4 (log  t)~2Ht  converges  in  distribution  to  the 
law  of  the  hitting  time  of  1  by  a  linear  Brownian  motion  started  from  0. 
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Proof  For  every  a  >  0,  set  Ta  —  inf{t  >  0  :  /3t  —  a)  and,  for  every  A  >  0, 
jiA)  =  inf{t  >  0  :  —  a}.  For  the  sake  of  simplicity,  we  write  Xt  —  (log  t)/2 

throughout  the  proof  and  always  assume  that  t  >  1 .  We  first  verify  that,  for  every 

£  >  0, 


’((A.) 


“2  H,  >  Ta 


1+e  ) 


■>  0. 


t — >oo 


(7.7) 


To  this  end,  recall  formula  (7.6),  which  shows  that 


{(A ,T2H,  >  7™}  = 


(«,  _ 


(A,)2r^» 


exp(2/6„)d u  <  t. 


=  1 A  losi 


exp(2/3M)  du  <  1>,  (7.8) 


since  2Xt  —  log  t.  From  the  change  of  variables  u  —  (Xt)2v  in  the  integral,  we  get 


1  f 

n,  ‘n 


exp(2^„)  du  =  + 

At  ZA  i 


1  f<+s  ,,  . 

—  log  /  exp(2A,/lf'))  dt>. 

A t  Jo 


We  then  note  that,  for  every  fixed  t  >  1,  the  quantity  in  the  right-hand  side  has  the 
same  distribution  as 


log  A; 

A,  +  2A 


■Tl+l 


1  f 

—  log  /  exp(2A,$„)dt> 

At  Jo 


(7.9) 


/v\ 

since  for  any  A  >  0  the  scaled  Brownian  motion  (J3t  )t>0  has  the  same  distribution 
as  (Pt)t>o-  We  then  use  the  simple  analytic  fact  stating  that,  for  any  continuous 
function/  :  M+  — »  R,  for  any  s  >  0, 

1  fs 

—  log  /  exp(2A/(u))  du  — >  sup /(r). 

2 A  J q  A— >oo  0<r<s 


We  leave  the  proof  as  an  exercise  for  the  reader.  It  follows  that 


i  rT' 
2A  108 /„ 


+£ 


exp(2X/3v)  dv  — >  sup  /3r  =  1  +  s, 

0  A— >oo  0<r<7\_|_£ 


a.s.,  and  so  the  quantity  in  (7.9)  converges  to  1  +  e,  a.s.  as  t  ->  oo.  Thus, 


Aiosi 


(A,)27-[^ 


exp(2/3M)  dw  — a  1  +  e 

/ — ^oo 
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in  probability.  Hence  the  probability  of  the  event  in  the  right-hand  side  of  (7.8) 
tends  to  0,  proving  that  (7.7)  holds.  The  very  same  arguments  show  that,  for  every 
£  G  (0,  1), 


p({X,)-2h,<t\x^  0. 


The  desired  result  now  follows,  noting  that  T^£  <  T[Xt)  <  and  that  T^  —T^l 
has  the  same  distribution  as  T\+s  —  T\-e,  which  tends  to  0  in  probability  when  e  ->  0 
(clearly,  T\-e  \  T\  as  e  ->  0,  and  on  the  other  hand  T\+e  l  7j  a.s.  as  e  ->  0,  as  a 
consequence  of  the  strong  Markov  property  at  time  7j  and  Proposition  2. 14  (i)).  □ 

Proof  of  Theorem  7.20  We  keep  the  notation  introduced  in  the  preceding  proof 
and  also  consider,  for  every  A  >  0,  the  scaled  Brownian  motion  y/A)  =  j  yyit . 
Recalling  our  notation  Xt  =  (log  t)/2  for  t  >  1,  we  have 


2 

log* 


0t  =  t-  yHt  =  y 

At 


(Ad 

(A  t)-2Ht- 


It  then  follows  from  Lemma  7.21  (using  also  the  fact  that  the  linear  Brownian 
motions  y(A)  all  have  the  same  distribution)  that 


2 

log* 


in  probability. 

(A) 

To  complete  the  proof,  we  just  have  to  notice  that,  for  every  fixed  A  >  0,  y  (A) 

has  the  standard  symmetric  Cauchy  distribution.  Indeed,  since  (/3(A) ,  ]/A))  is  a  pair 
of  independent  linear  Brownian  motions  started  from  0,  this  variable  has  the  same 
distribution  as  yTl ,  and  its  characteristic  distribution  is  computed  by  conditioning 
first  with  respect  to  T\ ,  and  then  using  the  Laplace  transform  of  T\  found  in  Example 
(c)  after  Corollary  3.24, 


E[exp(^Yrt )]  =  £[exp(-l  ^2Ti)]  =  exp(-|£|), 

which  we  recognize  as  the  characteristic  function  of  the  Cauchy  distribution.  □ 

The  skew-product  decomposition  and  Lemma  7.21  can  be  used  to  derive  other 
asymptotic  laws.  We  know  that  the  planar  Brownian  motion  B  started  from  z  ^ 
0  does  not  hit  0  a.s.,  but  on  the  other  hand  the  recurrence  property  ensures  that 
min{|Z?9|  :  0  <  s  <  t}  tends  to  0  as  t  ->  oo.  One  may  then  ask  about  the  typical  size 
of  min{|5s|  :  0  <  s  <  t}  when  t  is  large:  In  other  words,  at  which  speed  does  planar 
Brownian  motion  approach  a  point  different  from  its  starting  point? 
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Proposition  7.22  Consider  the  planar  Brownian  motion  B  started  from  z  ^  0. 
Then,  for  every  a  >  0, 


lim  p( 

t — >oo  \ 


min 

0  <s<t 


B< 


1 

14-  a 


For  instance,  the  probability  that  Brownian  motion  started  from  a  nonzero  initial 
value  comes  within  distance  l/t  from  the  origin  before  time  t  converges  to  1  /3  as 
t  ->  oo,  a  result  which  was  not  so  easy  to  guess! 

Proof  Without  loss  of  generality,  we  take  z  —  1 .  We  keep  the  notation  introduced 
in  the  proofs  of  Lemma  7.21  and  Theorem  7.20.  We  observe  that 


min 

0<s-<f 


B < 


)  =  min  6h  =  ruin  6 

/  0  <s<t  0  <s<Ht 


s  • 


It  follows  that 


2 

log/ 


mm 

0<s<t 


B< 


) 


—  min  6S  —  min 

Xt  0 <s<Ht  0<s<(Xt)~2 


By  Lemma  7.21, 


min  p[Xt)  —  min  f}^  — >  0 

o  <s<(Xt)~2  o<s<r[Xt)  t^oo 


in  probability.  We  conclude  that  we  have  the  following  convergence  in  distribution, 


- log  (  min  Bs  ^  — >  min  Bs, 

log  t  V  o<.<r  /  t-*oo  o<s<p  r 


log 

where  ft  is  a  linear  Brownian  motion  started  from  0  and  T\  —  infl.v  >0:^  =  1}. 
To  complete  the  argument,  note  that 


p(  min  ps  <  -a)  =  P(T-a  <  T\), 

\  0<s<T\  / 


if  T-a  —  inf{^  >0:fs  =  — a },  and  that  P(T~a  <  T\)  —  (1  +  a)  1  (cf.  Sect.  3.4). 

□ 

As  a  last  application  of  the  skew-product  decomposition,  we  state  the  Kallianpur- 
Robbins  asymptotic  law  for  the  time  spent  by  Brownian  motion  in  a  ball.  Here  the 
initial  value  can  be  arbitrary. 
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Theorem  7.23  Let  z  E  C  and  7?  >  0.  TTz^n,  under  Pz , 

L  i|i"j<"1  ds 

converges  in  distribution  as  t  —>  oo  to  an  exponential  distribution  with  mean  R 2. 

We  postpone  our  proof  of  Theorem  7.23  to  the  end  of  Chap.  9  since  it  relies  in 
part  on  the  theory  of  local  times  developed  in  that  chapter. 


Exercises 

In  all  exercises,  (Bt)t>o  is  a  d-dimensional  Brownian  motion  starting  from  x  under 
the  probability  measure  Px.  Except  in  Exercise  7.28,  we  always  assume  that  d  >  2. 

Exercise  7.24  Let  38  \  be  the  open  unit  ball  of  W i  (d  >  2),  and  38*  =  38\\{0}.  Let 
g  be  the  continuous  function  defined  on  d38*  by  g(v)  =  0  if  \x\  —  1  and  g(0)  =  1. 
Prove  that  the  Dirichlet  problem  in  38*  with  boundary  condition  g  has  no  solution. 

Exercise  7.25  (Polar  sets)  Throughout  this  exercise,  we  consider  a  nonempty 
compact  subset  K  of  Rd  (d  >  2).  We  set  TK  —  inf{t  >  0  :  Bt  e  K}.  We  say 
that  K  is  polar  if  there  exists  mix  e  Kc  such  that  Px(Tk  <  oo)  =  0. 

1.  Using  the  strong  Markov  property  as  in  the  proof  of  Proposition  7.7  (ii),  prove 
that  the  function  v  i->  Px(Tk  <  oo)  is  harmonic  on  every  connected  component 
of  Kc. 

2.  From  now  on  until  question  4.,  we  assume  that  K  is  polar.  Prove  that  Kc  is 
connected,  and  that  the  property  PX(TK  <  oo)  =  0  holds  for  every  x  e  Kc. 
(Hint:  Observe  that  {x  G  Kc  :  Px(Tk  <  oo)  =  0}  is  both  open  and  closed). 

3.  Let  D  be  a  bounded  domain  containing  K ,  and  D'  —  D\K.  Prove  that  any 
bounded  harmonic  function  h  on  Dr  can  be  extended  to  a  harmonic  function  on 
D.  Does  this  remain  true  if  the  word  “bounded”  is  replaced  by  “positive”? 

4.  Set  g(v)  =  0  if  v  G  dD  and  g(v)  =  1  if  v  G  dD'\dD.  Prove  that  the 
Dirichlet  problem  in  D'  with  boundary  condition  g  has  no  solution.  (Note  that 
this  generalizes  the  result  of  Exercise  7.24.) 

5.  If  a  g  (0,  d\,  we  say  that  the  compact  set  K  has  zero  a -dimensional  Hausdorff 
measure  if,  for  every  e  >  0,  we  can  find  an  integer  Ns  >  1  and  Ns  open  balls 
7^(1), . . . ,  3B{Ne)  with  respective  radii  r(p, . . .  rpve),  such  that  K  is  contained  in  the 
union  38 ^  U  •  •  •  U  38(ns)>  an(l 


Ns 


7=1 


<  £. 


Prove  that  if  d  >  3  and  K  has  zero  d  —  2-dimensional  Hausdorff  measure  then  K 
is  polar. 
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Exercise  7.26  In  this  exercise,  d  >  3.  Let  K  be  a  compact  subset  of  the  open  unit 
ball  of  Rd,  and  TK  inf {t  >  0  :  Bt  e  K}.  We  assume  that  D  Wl\K  is  connected. 
We  also  consider  a  function  g  defined  and  continuous  on  K.  The  goal  of  the  exercise 
is  to  determine  all  functions  u  :  D  ->  R  that  satisfy: 

(P)  u  is  bounded  and  continuous  on  Z),  harmonic  on  D ,  and  u(y)  =  g(y)  if  y  e  dD. 

(This  is  the  Dirichlet  problem  in  D,  but  in  contrast  with  Sect.  7.3  above,  D  is 
unbounded  here.)  We  fix  an  increasing  sequence  (Rn)n>  t  of  reals,  with  R\  >  1 
and  Rn  \  oo  as  n  ->  oo.  For  every  n  >  1,  we  set  7(n)  :  =  inf{t  >  0  :  \Bt\  > 

1.  Suppose  that  m  satisfies  (P).  Prove  that,  for  every  n  >  1  and  every  x  e  D  such 
that  |v|  < 


t/(v)  —  £x[g(Z?rA,)  l{7v<r(n)}]  +  Fjw(Z?r(/7))  l{T(n)<TK}\- 

2.  Show  that,  by  replacing  the  sequence  {Rn)n>  t  with  a  subsequence  if  necessary, 
we  may  assume  that  there  exists  a  constant  such  that,  for  every  x  e  D, 


and  that  we  then  have 


lim  u(x)  —  a. 

%|  — >oo 


3.  Show  that,  for  every  x  e  D, 


u(x)  =  Ex[g(BTK)  1{7^<oo}]  +  0!  Px(Tk  =  oo). 


4.  Assume  that  D  satisfies  the  exterior  cone  condition  at  every  y  e  dD  (this  is 
defined  in  the  same  way  as  when  D  is  bounded).  Show  that,  for  any  choice  of 
a  e  M,  the  formula  of  question  3.  gives  a  solution  of  the  problem  (P). 

Exercise  7.27  Let /  :  C  C  be  a  nonconstant  holomorphic  function.  Use  planar 
Brownian  motion  to  prove  that  the  set  {fig)  :  z  G  C}  is  dense  in  C.  (Much  more  is 
true,  since  Picard’s  little  theorem  asserts  that  the  complement  of  {f(z)  :  z  G  C}  in  C 
contains  at  most  one  point:  This  can  also  be  proved  using  Brownian  motion,  but  the 
argument  is  more  involved,  see  [12].) 

Exercise  7.28  ( Feynman-Kac  formula  for  Brownian  motion)  This  is  a  continua¬ 
tion  of  Exercise  6.26  in  Chap.  6.  With  the  notation  of  this  exercise,  we  assume  that 
E  —  Rd  and  Xt  —  Bt.  Let  v  be  a  nonnegative  function  in  Co(MJ),  and  assume  that 
v  is  continuously  differentiable  with  bounded  first  derivatives.  As  in  Exercise  6.26, 
set,  for  every  <p  e  B(Rd ), 


Q*(p(x)  =  Ex  <p(X,)  exp  (  -  jf  v(Xs)  d.vj  . 
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1.  Using  the  formula  derived  in  question  2.  of  Exercise  6.26,  prove  that,  for 
every  t  >  0,  and  every  ip  G  Co(Rd),  the  function  Q* ip  is  twice  continuously 
differentiable  on  W \  and  that  Q*  cp  and  its  partial  derivatives  up  to  order  2  belong 
to  Co(MJ).  Conclude  that  Q*cp  e  D(L). 

2.  Let  ip  G  Co(MJ)  and  set  ut(x )  =  Q*<p(x)  for  every  t  >  0  and  v  G  Wl.  Using 
question  3.  of  Exercise  6.26,  prove  that,  for  every  v  G  M.d,  the  function  1 1->  ut(x) 
is  continuously  differentiable  on  (0,  oo),  and 


dut 

dt 


1 

-Aut  —  v  ut 
2' 


Exercise  7.29  In  this  exercise  d  —  2  and  M2  is  identified  with  the  complex  plane 
C.  Let  a  g  (0,  tt),  and  consider  the  open  cone 

^  =  {re10:r>O,0G  (—a,  a)}. 


Set  T:=  M{t  >  0  :  Bt  £  %}. 

1.  Show  that  the  law  of  log  \BT\  under  P\  is  the  law  of  /Jinf{f>o:|y,|=a}>  where  /3  and 
y  are  two  independent  linear  Brownian  motions  started  from  0. 

2.  Verify  that,  for  every  A  G  R, 

r,  [piAloglfirh  _  1 

cosh(ofA) 


Notes  and  Comments 

Connections  between  Brownian  motion  and  partial  differential  equations  have  been 
known  for  a  long  time  and  motivated  the  study  of  this  random  process.  A  survey  of 
the  partial  differential  equations  that  can  be  solved  in  terms  of  Brownian  motion  can 
be  found  in  the  book  of  Durrett  [18,  Chapter  8].  The  representation  of  Theorem  7.1 
(written  in  terms  of  the  Gaussian  density)  goes  back  to  the  ninetieth  century  and  the 
work  of  Fourier  and  Laplace  -  see  the  references  in  [49] .  The  beautiful  relations 
between  Brownian  motion  and  harmonic  functions  were  discovered  and  studied 
by  Kakutani  [45,  46],  and  Hunt  [33,  34]  later  studied  the  connections  between 
potential  theory  and  transient  Markov  processes  (see  the  Blumenthal-Getoor  book 
[5]  for  more  on  this  topic).  Nice  accounts  of  the  links  between  Brownian  motion 
and  classical  potential  theory  can  be  found  in  the  books  by  Port  and  Stone  [69] 
and  Doob  [16]  (see  also  Ito  and  McKean  [42],  Chung  [9],  and  Chapters  3  and  8  of 
[62]).  The  conformal  invariance  of  planar  Brownian  motion  was  stated  by  Levy  [54] 
with  a  very  sketchy  proof.  Davis’  paper  [12]  is  a  nice  survey  of  relations  between 
planar  Brownian  motion  and  analytic  functions,  see  also  Durrett’s  book  [18],  and  the 
paper  [28]  by  Getoor  and  Sharpe  for  a  notion  of  conformal  martingale  that  plays  in 
martingale  theory  a  role  similar  to  that  of  analytic  functions.  Spitzer’s  theorem  was 
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obtained  in  the  classical  paper  [75],  and  the  Kallianpur-Robbins  law  was  derived  in 
[48].  A  number  of  remarkable  properties  of  planar  Brownian  motion  had  already 
been  observed  by  Levy  [53]  in  1940.  We  refer  to  Pitman  and  Yor  [68]  for  a 
systematic  study  of  asymptotic  laws  of  planar  Brownian  motion.  Our  presentation 
closely  follows  [52],  where  other  applications  of  the  skew-product  decomposition 
can  be  found. 


Chapter  8 

Stochastic  Differential  Equations 


This  chapter  is  devoted  to  stochastic  differential  equations,  which  motivated  Ito’s 
construction  of  stochastic  integrals.  After  giving  the  general  definitions,  we  provide 
a  detailed  treatment  of  the  Lipschitz  case,  where  strong  existence  and  uniqueness 
statements  hold.  Still  in  the  Lipschitz  case,  we  show  that  the  solution  of  a  stochastic 
differential  equation  is  a  Markov  process  with  a  Feller  semigroup,  whose  generator 
is  a  second-order  differential  operator.  By  results  of  Chap.  6,  the  Feller  property 
immediately  gives  the  strong  Markov  property  of  solutions  of  stochastic  differential 
equations.  The  last  section  presents  a  few  important  examples.  This  chapter  can  be 
read  independently  of  Chap.  7. 


8.1  Motivation  and  General  Definitions 

The  goal  of  stochastic  differential  equations  is  to  provide  a  model  for  a  differential 
equation  perturbed  by  a  random  noise.  Consider  an  ordinary  differential  equation  of 
the  form 


y'(t)  =  b(y(t)), 


or,  in  differential  form, 


dyt  =  b(yt)  dt. 


Such  an  equation  is  used  to  model  the  evolution  of  a  physical  system.  If  we  take 
random  perturbations  of  the  system  into  account,  we  add  a  noise  term,  which  is 
typically  of  the  form  a  d Bt,  where  B  denotes  a  Brownian  motion,  and  a  is  a  constant 
corresponding  to  the  intensity  of  the  noise.  Note  that  the  use  of  Brownian  motion 
here  is  justified  by  its  property  of  independence  of  increments,  corresponding  to  the 
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fact  that  the  random  perturbations  affecting  disjoint  time  intervals  are  assumed  to 
be  independent. 

In  this  way,  we  arrive  at  a  stochastic  differential  equation  of  the  form 

dy,  =  b(yt)  dt  +  a  d Bt, 

or  in  integral  form,  the  only  one  with  a  rigorous  mathematical  meaning, 

yt  -  yo  +  /  b(ys)  ds  +  a  Bt. 

Jo 

We  generalize  the  preceding  equation  by  allowing  a  to  depend  on  the  state  of  the 
system  at  time  t : 


d yt  =  b(yt)  dt  +  a(y,)  d B„ 


or,  in  integral  form, 


ds+  a(ys)dBs. 

Jo 

Because  of  the  integral  in  d Bs,  the  preceding  equation  only  makes  sense  thanks  to 
the  theory  of  stochastic  integrals  developed  in  Chap.  5.  We  can  still  generalize  the 
preceding  equation  by  allowing  a  and  b  to  depend  on  the  time  parameter  t.  This 
leads  to  the  following  definition. 

Definition  8.1  Let  d  and  m  be  positive  integers,  and  let  a  and  b  be  locally  bounded 
measurable  functions  defined  on  M+  x  Rd  and  taking  values  in  MjXm  (M)  and  in  W1 
respectively,  where  is  the  set  of  all  d  x  m  matrices  with  real  coefficients. 

We  write  a  =  ((Jij)\<i<d,i<j<m  and  b  =  0/)i  </<</. 

A  solution  of  the  stochastic  differential  equation 

E(<j,  b )  dXt  =  cr(t,  Xt)  d Bt  +  b(t,  Xt)  dt 

consists  of: 

•  a  filtered  probability  space  (X2,  (^t)te[o,oo\,  P)  (where  the  filtration  is  always 

assumed  to  be  complete); 

•  an  ra-dimensional  Brownian  motion  B  —  (Bl , ,  Bm )  started  from  0; 

•  an  (J^i)-adapted  process  X  =  ( X \  . . .  ,Xd )  with  values  in  M.d,  with  continuous 
sample  paths,  such  that 
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meaning  that,  for  every  i  e  { 1 , ,d}, 

m  pt  pt 

Xlt=Xl0  +  Y  I  +  /  bi(s,Xs)  ds\ 

■=1  Jo  Jo 

If  additionally  Xo  =  x  e  Rd,  we  say  that  X  is  a  solution  of  Ex(<j,  b). 

Note  that,  when  we  speak  about  a  solution  of  E(a,  b ),  we  do  not  fix  a  priori  the 
filtered  probability  space  and  the  Brownian  motion  B.  When  we  fix  these  objects, 
we  will  say  so  explicitly. 

There  are  several  notions  of  existence  and  uniqueness  for  stochastic  differential 
equations. 

Definition  8.2  For  the  equation  E(a,  b)  we  say  that  there  is 

•  weak  existence  if,  for  every  x  eWl,  there  exists  a  solution  of  Ex(p ,  b)\ 

•  weak  existence  and  weak  uniqueness  if  in  addition,  for  every  x  eW1,  all  solutions 
of  Ex(a,  b)  have  the  same  law; 

•  pathwise  uniqueness  if,  whenever  the  filtered  probability  space  (f2,  P) 

and  the  (J^) -Brownian  motion  B  are  fixed,  two  solutions  X  and  X'  such  that 
Xo  =  Xq  a.s.  are  indistinguishable. 

Furthermore,  we  say  that  a  solution  X  of  Ex(a,  b)  is  a  strong  solution  if  X  is  adapted 
with  respect  to  the  completed  canonical  filtration  of  B. 

Remark  It  may  happen  that  weak  existence  and  weak  uniqueness  hold  but  pathwise 
uniqueness  fails.  For  a  simple  example,  consider  a  real  Brownian  motion  /3  started 
from  /3q  —  y,  and  set 


B,=  f  sgn  (/3V)  d/3(, 

Jo 

where  sgn  (v)  =  1  if  v  >  0  and  sgn  (v)  =  —  1  if  x  <  0.  Then,  one  immediately  gets 
from  the  “associativity”  of  stochastic  integrals  that 


ft  =y  + 


Moreover,  B  is  a  continuous  martingale  with  quadratic  variation  ( B,B)t  =  t ,  and 
Theorem  5.12  shows  that  B  is  a  Brownian  motion  started  from  0.  We  thus  see  that  ft 
solves  the  stochastic  differential  equation 


dX,  =  sgn  (Xt)  d Bt,  X0  =  y, 


and  it  follows  that  weak  existence  holds  for  this  equation.  Theorem  5. 12  again  shows 
that  any  other  solution  of  this  equation  must  be  a  Brownian  motion,  which  gives 
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weak  uniqueness.  On  the  other  hand,  path  wise  uniqueness  fails.  In  fact,  taking  y  —  0 
in  the  construction,  one  easily  sees  that  both  /3  and  —ft  solve  the  preceding  stochastic 
differential  equation  with  the  same  Brownian  motion  B  and  initial  value  0  (note  that 
/q  1{^=o}  ds  —  0,  which  implies  /Qr  l{^s=o}  d Bs  —  0).  One  can  also  show  that  is 
not  a  strong  solution:  One  verifies  that  the  canonical  filtration  of  B  coincides  with 
the  canonical  filtration  of  |/3|,  which  is  strictly  smaller  than  that  of  /3  (we  omit  the 
proof,  which  is  a  simple  application  of  formula  (9.18)  in  Chap.  9). 

The  next  theorem  links  the  different  notions  of  existence  and  uniqueness. 

Theorem  (Yamada-Watanabe)  If  both  weak  existence  and  pathwise  uniqueness 
hold,  then  weak  uniqueness  also  holds.  Moreover,  for  any  choice  of  the  filtered 
probability  space  (k2,  ,  (#i),  P)  and  of  the  (J^i)  -Brownian  motion  B,  there  exists 

for  every  x  e  a  ( unique )  strong  solution  ofEx(a,  b). 

We  omit  the  proof  (see  Yamada  and  Watanabe  [83])  because  we  will  not  need 
this  result.  In  the  Lipschitz  case  that  we  will  consider,  we  will  establish  directly  the 
properties  given  by  the  Yamada-Watanabe  theorem. 


8.2  The  Lipschitz  Case 


In  this  section,  we  work  under  the  following  assumptions. 

Assumptions  The  functions  o  are  b  are  continuous  on  R+  x  W1  and  Lipschitz  in 
the  variable  x:  There  exists  a  constant  K  such  that,  for  every  t  >  0,  x,  y  e  W1, 


I cr(t,x)  -o(t,y)\  <  K\x-y\, 
I b(t,x)  ~b(t,y) |  <K\x-y\. 


Theorem  8.3  Under  the  preceding  assumptions,  pathwise  uniqueness  holds  for 
E(o,  b),  and,  for  every  choice  of  the  filtered  probability  space  (£?,  ( J^i) ,  P)  and 
of  the  (^t) -Brownian  motion  B,  for  every  x  e  M.d,  there  exists  a  ( unique )  strong 
solution  of  Ex(p,  b). 

The  theorem  implies  in  particular  that  weak  existence  holds  for  E(a,  b).  Weak 
uniqueness  will  follow  from  the  next  theorem  (it  can  also  be  deduced  from  pathwise 
uniqueness  using  the  Yamada-Watanabe  theorem). 

Remark  One  can  “localize”  the  Lipschitz  assumption  on  a  and  b ,  meaning  that 
the  constant  K  may  depend  on  the  compact  set  on  which  the  parameters  t  and  x,  y 
are  considered.  In  that  case,  it  is,  however,  necessary  to  keep  a  condition  of  linear 
growth  of  the  form 

\o(t,x)\  <  K(  1  +  |*|),  \b(t,x)\  <  K(  1  +  |*|). 
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This  kind  of  condition,  which  avoids  the  blow-up  of  solutions,  already  appears  in 
ordinary  differential  equations. 

Proof  For  the  sake  of  simplicity,  we  consider  only  the  case  d  —  m  —  1 .  The  reader 
will  be  able  to  check  that  the  general  case  follows  from  exactly  the  same  arguments, 
at  the  cost  of  a  heavier  notation.  Let  us  start  by  proving  path  wise  uniqueness.  We 
consider  (on  the  same  filtered  probability  space,  with  the  same  Brownian  motion  B) 
two  solutions  X  and  X'  such  that  Xq  —  X'0.  Fix  M  >  0  and  set 

r  =  inf{t  >  0  :  \Xt\  >  M  or  \X[\  >  M}. 


Then,  for  every  t  >  0, 


=  *o  + 


<r(s,Xs)  d Bs  + 


ds 


and  an  analogous  equation  holds  for  X'tAz.  Fix  a  constant  T  >  0.  By  considering 
the  difference  between  the  two  equations  and  using  the  bound  (5.14),  we  get,  for 
te  [0,71, 


£[(X,Ar  -  VAt)2] 


<  2  E 


<  2 


f  (a(s,Xs)-a(s,X's))dB^ 

|2]+2£[( 

nt/\X 

J  (a(s,Xs)-a(s,X's))2ds 

+  TE 

u. 

't  Ar 


Jo 

t/\T 


(b(s,  Xs)  -  b(s,  X'))2d s 


r  WAr 

<  2K2(1  +  T)  J  (Xs  -  V)2dvJ 

<  2K2(1  +  T)E[[  (XS/,T  —  VAT)2d.v] 


Hence  the  function  h(t)  —  E[(XtAz  —  XtAz)z]  satisfies 


h(t)  <  C  [  h(s )  ds 

Jo 


for  every  t  G  [0,  T\,  with  C  =  2K2{\  +  T ). 

Lemma  8.4  (Gronwall’s  lemma)  Let  T  >  0  and  let  g  be  a  nonnegative  bounded 
measurable  function  on  [0,  T\.  Assume  that  there  exist  two  constants  a  >  0  and 
b  >  0  such  that,  for  every  t  G  [0,  T\, 


g(t)  <a  +  bf  g(s)  ds. 

Jo 
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Then,  we  also  have,  for  every  t  G  [0,  T], 

git)  <  a  exp  (ht). 


Proof  of  the  lemma  By  iterating  the  condition  on  g,  we  get, 


g(t)  <  a  +  a(bt )  +  bz 


/  dsf 


drg(r) 


<  a  +  a(bt)  +  a^-J-  H - h  cJ—L  +  bn+l 


n\ 


rt  nS  1  nSn 

/  d si  d s2  -j  ds„+ig(s„+i), 

Jo  Jo  Jo 


for  every  n  >  1.  If  A  is  a  constant  such  that  0  <  g  <  A,  the  last  term  in  the  right- 
hand  side  is  bounded  above  by  A(bt)n+l / (n  +1)!,  hence  tends  to  0  as  n  — >  oo.  The 
desired  result  now  follows.  □ 

Let  us  return  to  the  proof  of  the  theorem.  The  function  h  is  bounded  above  by 
4 M2  and  the  assumption  of  the  lemma  holds  with  a  =  0,  b  —  C.  We  thus  get  h  —  0, 
so  that  XtAz  =  X'tAr.  By  letting  M  tend  to  oo,  we  get  Xt  —  X',  which  completes  the 
proof  of  path  wise  uniqueness. 

For  the  second  assertion,  we  construct  a  solution  using  Picard’s  approximation 
method.  We  define  by  induction 


$  =  ^ 
x)  —  X  + 

X?  =  x  + 


i 

f 


c j(s ,  v)  d Bs  +  /  h(s,  v)  d^, 


l 


o(s,X"-])dBs  +  /  b(s.X 


L 


m— 1 


The  stochastic  integrals  are  well  defined  since  one  verifies  by  induction  that,  for 
every  n ,  the  process  Xn  is  adapted  and  has  continuous  sample  paths. 

It  is  enough  to  show  that,  for  every  T  >  0,  there  is  a  strong  solution  of  Ex(o,  b ) 
on  the  time  interval  [0,  T\.  Indeed,  the  uniqueness  part  of  the  argument  will  then 
allow  us  to  get  a  (unique)  strong  solution  on  M+  that  will  coincide  with  the  solution 
on  [0,  T\  up  to  time  T. 

We  fix  T  >  0  and,  for  every  n  >  1  and  every  t  G  [0,  T],  we  set 


gn(t)  =  E 


sup  \Xns  —  X 

L  CX.v</ 


n— 1  \2 

s 


We  will  bound  the  functions  gn  by  induction  on  n  (at  present,  it  is  not  yet  clear 
that  these  functions  are  finite).  The  fact  that  the  functions  a(-,v)  and  b(-,x )  are 
continuous,  hence  bounded,  over  [0,  T]  implies  that  there  exists  a  constant  CfT  such 
that  gi(f)  <  C'T  for  every  t  G  [0,  T\  (use  Doob’s  inequality  in  L2  for  the  stochastic 
integral  term). 
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Then  we  observe  that 


x?+1-x?=  I  (o(s,Xn\-o(s,XT'))dBs+  I  (b(s,x?)-b(s,xrl))ds 


-f 


rfl—l 


/ 


ril —  1 ' 


Hence,  using  the  case  p  —  2  of  the  Burkholder-Davis-Gundy  inequalities  in  the 
second  bound  (and  writing  Cp)  for  the  constant  in  this  inequality), 


sup  |x"+1  -  x>; 


L  0<s<t 


<  2  E 


sup 

L  0<s<t 


L 


(o(u,X:)-o(u,Xrl))dBu 


+  sup 

0  <s<t 


[ 


(b(u,Xnu)-b(u,Xn-0)du 


2-i 


<  2  (c(2)E[f(G(u,Xl)  -  G(u,Xn~x))2  du 


+TE[  J,(b(u,Xnu)-b(u,Xn-l))2duj) 


<  2(C(2)  +  T)KZE  IV  -  K 


[J 


rn- 1 12 


d  u 


<  CtE 


[f 


sup  |X”  —  X”  !|2dw 


0  0<r<u 


where  =  2(C(2)  +  T)X2.  We  have  thus  obtained  that,  for  every  n  >  1 


gn+l(0  <CT  gn(u)du. 

Jo 


(8.1) 


Recalling  that  gi(t)  <  CrT ,  an  induction  argument  using  (8.1)  shows  that,  for  every 
n  >  1  and  £  e  [0,  T], 


s«(0  <  Crr(cr) 


/.ft— 1 


ft— 1 


(«  -  1)! 


In  particular,  ,  ft,  (7') 1/2  <  oo,  which  implies  that 


E 


sup  |X?+1-V 

0<t<T 


<  OO, 


a.s. 


Hence  the  sequence  (X",  0  <  t  <  T)  converges  uniformly  on  [0,  T],  a.s.,  to  a  limiting 
process  (Xt,0  <  t  <  T),  which  has  continuous  sample  paths.  By  induction,  one  also 
verifies  that,  for  every  n ,  Xn  is  adapted  with  respect  to  the  (completed)  canonical 
filtration  of  B ,  and  the  same  holds  for  X. 
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Finally,  from  the  fact  that  a  and  b  are  Lipschitz  in  the  variable  x,  we  also  get  that, 
for  every  t  g  [0,  T], 


lim 

n—>o o 


lim 

n—>oo 


a(s,Xs)dBi 


b(s,Xs )  ds 


in  probability  (to  deal  with  the  stochastic  integrals,  we  may  use  Proposition  5.8, 
noting  that  \Xns  —  Xs\  is  dominated  by  sup0<r<5  l^f+1  —  X*|).  By  Passing  to 

the  limit  in  the  induction  equation  defining  Xn ,  we  get  that  X  solves  Ex(o,  b)  on 
[0,  T].  This  completes  the  proof  of  the  theorem.  □ 

In  the  following  statement,  W(dw)  stands  for  the  Wiener  measure  on  the 
canonical  space  C(R+,  Rm)  of  all  continuous  functions  from  M+  into  Rm  (W'(dw) 
is  the  law  of  (Bt,  t  >  0)  if  B  is  an  ra-dimensional  Brownian  motion  started  from  0). 

Theorem  8.5  Under  the  assumptions  of  the  preceding  theorem,  there  exists,  for 
every  x  G  R,  a  mapping Fx  :  C(R+,Rm)  — »  C(M+,MJ),  which  is  measurable  when 
C(R+,  Wn )  is  equipped  with  the  Borel  o -field  completed  by  the  W -negligible  sets, 
and  C(R+,  MJ)  is  equipped  with  the  Borel  a -field,  such  that  the  following  properties 
hold: 

(i)  for  every  t  >  0,  Fx{ w)r  coincides  W(dw)  a.s.  with  a  measurable  function  of 
(w(r),  0  <  r  <  t); 

(ii)  for  every  w  G  C(R+,  MJn),  the  mapping  x  ^  Fx( w)  is  continuous; 

(iii)  for  every  x  G  W1,  for  every  choice  of  the  ( complete )  filtered  probability 
space  (£2,^,  (^t),P)  and  of  the  m-dimensional  (2^)- -Brownian  motion  B 
with  Bo  —  0,  the  process  Xt  defined  Xt  —  Fx(B)t  is  the  unique  solution  of 
Ex(o ,  b);  furthermore,  if  U  is  an  & o-measurable  real  random  variable,  the 
process  Ejj(B)t  is  the  unique  solution  with  Xo  =  U. 

Remark  Assertion  (iii)  implies  in  particular  that  weak  uniqueness  holds  for 
E((j,b):  any  solution  of  Ex(o,b )  must  be  of  the  form  EX(B)  and  its  law  is  thus 
uniquely  determined  as  the  image  of  W(dw)  under  Fx. 

Proof  Again  we  consider  only  the  case  d  —  m  —  1.  Let  be  the  class  of  all 

W-negligible  sets  in  C(R+,  R),  and,  for  every  t  G  [0,  oo],  set 


%  —  a(w(s),  0  <  s  <  t )  v  . 


For  every  v  G  R,  we  write  Xx  for  the  solution  of  Ex(cr,  b )  corresponding  to  the 
filtered  probability  space  (C(R+,  R),  ^oo,  &t),W)  and  the  (canonical)  Brownian 
motion  Z^(w)  =  w (t).  This  solution  exists  and  is  unique  (up  to  indistinguishability) 
by  Theorem  8.3,  noting  that  the  filtration  (%)  is  complete  by  construction. 
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Let  xjgM  and  let  Tn  be  the  stopping  time  defined  by 

Tn  —  inf{£  >  0  :  \X*\  >  n  or  \Xyt  |  >  n). 

Let  p  >  2  and  T  >  1.  Using  the  Burkholder-Davis-Gundy  inequalities  (Theo¬ 
rem  5.16)  and  then  the  Holder  inequality,  we  get,  for  t  G  [0,  T], 


sup  |X*a7-  -  XjA7. 


L  £<f 


<Cp(\x-y\p+E  sup  /  (cr(r,Xx)  -  o(r,Xyr))dB 

v  L  ,v<r  jo 


f 


+E 


sup 

L  s<t 


n  s  AT n  p 

J  ( b(r ,  Xxr)  -  b(r ,  Xy))  dr  j 


ptf\T„  /  2-. 

<  Cp  (|x  -  y\P  +  CpE[(  I  (<r(r,  X,!)  -  a(r,  X>’))2drJ 


+£ 


tAT„ 


\b(r,Xx)  ~  b(,%X^)\dryj) 


<  CP(|x  -  y|*  +  jf  |a(r  A  T^X^)  -  a(r  A  Tn,XyrATn)\pdr 

+f~l  E[J^  \b(r  A  Tn,XxrATn) -b(r  A  r„,X?Al.  )P’dr]) 
<C;(|x-#  +  r  f  £[|X* 


—  X3'  H  dr^l 

fATn  ArArJ  JUry’ 


where  the  constants  Cp,  C^,,  C"  <  oo  depend  on  p  (and  on  the  constant  AT  appearing 
in  our  assumption  on  a  and  Z?)  but  not  on  n  or  on  x,  y  and  T. 


As  the  function  t  \-+  E 
that,  for  £  G  [0,  T\, 


sup s<,  |X;a7,  -  XjA7.n 


is  bounded,  Lemma  8.4  implies 


sup  \X, 

L  s<t 


X 


SAT,, 


yy 

^sATn 


p 

<  c'' 

—  p 

<  c:\x-y\p  exp (c;r?), 


hence,  letting  ^  tend  to  oo, 


L  5<? 


sup  |X;  -  XJ  <  C" |x  -  y r  exp(C;  7A). 


The  topology  on  the  space  C(R+,  R)  is  defined  by  the  distance 


oo 


d(w,  w')  =  V'ah  sup | w (s)  -  v/(s)|  a  1), 
k=  i  7 
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where  the  sequence  of  positive  reals  oik  can  be  chosen  in  an  arbitrary  way,  provided 
that  the  series  ^  oik  converges.  We  may  choose  the  coefficients  a k  so  that 

oo 

exp(C"F+1)  <  oo. 

k=  1 


For  every  tel,  we  consider  Xx  as  a  random  variable  with  values  in  C(M+ ,  R).  The 
preceding  estimates  and  Jensen’s  inequality  then  show  that 


x-y\ 


P 


with  a  constant  Cp  independent  of  v  and  y.  By  Kolmogorov’s  lemma  (Theorem  2.9), 
applied  to  the  process  ( Xx ,  teR)  with  values  in  the  space  E  —  C(R+,  R)  equipped 
with  the  distance  d,  we  get  that  (Xx,  x  e  R)  has  a  modification  with  continuous 
sample  paths,  which  we  denote  by  (Xx,x  e  R).  We  set  Fx{ w)  =  Xx(w)  = 
(Xf(w))t>o-  Property  (ii)  is  then  obvious. 

The  mapping  w  i->  Fx( w)  is  measurable  from  C(R+,  R)  equipped  with  the  a- 
field  into  C(R+,R)  equipped  with  the  Borel  a-field  —  o(yj(s),s  >  0). 
Moreover,  for  every  t  >  0,  Fx( w)t  —  Xx(w)  =  X*(w)  is  ^-measurable  hence 

coincides  W(dw)  a.s.  with  a  measurable  function  of  (w(s),0  <  s  <  t).  Thus 
property  (i)  holds. 

Let  us  now  prove  the  first  part  of  assertion  (iii).  To  this  end,  we  fix  the  filtered 
probability  space  (f2,  ,  P)  and  the  (J^)-Brownian  motion  B.  We  need  to 

verify  that  the  process  (Fx(B)t)t>o  then  solves  Ex(a ,  b).  This  process  (trivially)  has 
continuous  sample  paths,  and  is  also  adapted  since  Fx{B)t  coincides  a.s.  with  a 
measurable  function  of  (Br,  0  <  r  <  t),  by  (i),  and  since  the  filtration  (^t)  is 
complete.  On  the  other  hand,  by  the  construction  of  Fx  (and  because  Xx  —  Xx  a.s.), 
we  have,  for  every  t  >  0,  W(dw)  a.s. 

+  f  b(s,Fx(w)s)ds, 

Jo 

where  the  stochastic  integral  /Qr  a(s,  Fx(w)s)dw(s)  can  be  defined  by 


/' 

JO 


Fx(  w),  =  x  +  a(s,  Fx(w)s)  dw(s) 


2”*-l 


,  Fx(w)s)  dw(s)  =  lim  a( 

k — ^OO 


i= 0 


it 

2nk 


Fx(w)it/2"k)  (w( 


d  +  1  )t,  ,  it 

- )  —  w(  — 

ink  7  vO  nk 
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W(dw)  a.s.  Here  (tik)k>  1  is  a  suitable  subsequence,  and  we  used  Proposition  5.9.  We 
can  now  replace  w  by  B  (whose  distribution  is  W(dw) !)  and  get  a.s. 

2nk  —  1 

Ext 

o(—,Fx(B )  it/2nk)  (B(i+l)t/2nk  —  Bit/2"k)  + 

/v  ^  ^  z  k 

i= 0 

=  x+  [  a(s.  Fx(B)s)dBs  +  (  b{s,Fx(B)s)&s, 

Jo  Jo 

again  thanks  to  Proposition  5.9.  We  thus  obtain  that  FX(B )  is  the  desired  solution. 

We  still  have  to  prove  the  second  part  of  assertion  (iii).  We  again  fix  the  filtered 
probability  space  (£?,  (J^),P)  and  the  (J^)-Brownian  motion  B.  Let  U  be  an 

-measurable  random  variable.  If  in  the  stochastic  integral  equation  satisfied  by 
FX(B )  we  formally  substitute  U  for  x,  we  obtain  that  Fjj(B)  solves  E(a,b)  with 
initial  value  U.  However,  this  formal  substitution  is  not  so  easy  to  justify,  and  we 
will  argue  with  some  care. 

We  first  observe  that  the  mapping  (x,  co)  i->  Fx(B)t  is  continuous  with  respect  to 
the  variable  v  (if  co  is  fixed)  and  measurable  with  respect  to  co  (if  v  is  fixed).  It 
easily  follows  that  this  mapping  is  measurable  for  the  a-field  <^(R)  ®  FFt.  Since  U 
is  #o -measurable,  we  get  that  Fjj{B)t  is  ^-measurable.  Hence  the  process  Fjj{B) 
is  adapted.  For  v  e  R  and  w  e  C(R+,  R),  we  define  G(x,  w)  e  C(R+,  R)  by  the 
formula 


G(x,  w)t  =  f  b(s,Fx(w)s)ds. 
Jo 


We  also  set  H(x,  w)  =  Fx( w)  —  v  —  G(v,  w).  We  have  already  seen  that,  for  every 
v  G  R,  we  have  W(dw)  a.s., 


H(x,w)t=  /  cr^,  Fx(w)s)  dw(s). 
Jo 


Hence,  if 


2n— 1 

it  (i  +  1  )t  it 

Hn(x,  w),  =  y]  a(—  ,Fx(w)it/2n)  (w(l  ’  )  -  w(— )), 

i= 0 

Proposition  5.9  shows  that 


H(x,  w)r  =  lim  Hn(x,  w)(, 

n-^oo 

in  probability  under  W(dw),  for  every  v  e  R.  Using  the  fact  that  U  and  B  are 
independent  (because  U  is  -measurable),  we  infer  from  the  latter  convergence 
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that 


H(U ,  B),  =  lim  H„(U,  B), 

n— >oo 

in  probability.  Thanks  again  to  Proposition  5.9,  the  limit  must  be  the  stochastic 
integral 


,  Fjj(B)s)  d Bs. 


We  have  thus  proved  that 

[  cr(s,  Fu(B)s)  d Bs  =  H(U,  B)t  =  Fv{B)t  -U  -  [  b(s,  FV(B)S)  ds  , 

Jo  Jo 

which  shows  that  Fjj(B)  solves  E(p,  b)  with  initial  value  U.  □ 

A  consequence  of  Theorem  8.5,  especially  of  property  (ii)  in  this  theorem,  is  the 
continuity  of  solutions  with  respect  to  the  initial  value.  Given  the  filtered  probability 
space  (£2,  (  J^i) ,  P)  and  the  (J^)-Brownian  motion  B ,  one  can  construct,  for 
every  v  £  the  solution  Xx  of  Ex(o,  b)  in  such  a  way  that,  for  every  co  £  F2,  the 

mapping  v  i-^  Xx{cd)  is  continuous.  More  precisely,  the  arguments  of  the  previous 
proof  give,  for  every  s  £  (0,  1)  and  for  every  choice  of  the  constants  A  >  0  and 
T  >  0,  a  (random)  constant  C£  aj(oj)  such  that,  if  |jt|,  |y|  <  A, 

sup  I Xf((0)  -  Xf(a>)|  <  C£,A,r(ft>)  \x  -  j|1_£ 

t<T 

(in  fact  the  version  of  Kolmogorov’s  lemma  in  Theorem  2.9  gives  this  only  for 
d  —  1,  but  there  is  an  analogous  version  of  Kolmogorov’s  lemma  for  processes 
indexed  by  a  multidimensional  parameter,  see  [70,  Theorem  1.2.1]). 


8.3  Solutions  of  Stochastic  Differential  Equations  as  Markov 
Processes 

In  this  section,  we  consider  the  homogeneous  case  where  cr(fiy)  =  cr(y)  and 
b(t,y)  —  b(y).  As  in  the  previous  section,  we  assume  that  a  and  b  are  Lipschitz: 
There  exists  a  constant  K  such  that,  for  every  x,y  £  M.d, 

|cr(x)  —  cr(y)|  <  K\x  -  y\  ,  \b(x)-b(y)\  <  K\x-y\. 

Let  v  £  Rd,  and  letXx  be  a  solution  of  Ex((j,  b).  Since  weak  uniqueness  holds,  for 
every  t  >  0,  the  law  of  Xx  does  not  depend  on  the  choice  of  the  solution.  In  fact,  this 
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law  is  the  image  of  Wiener  measure  on  C(M+ ,  Rd)  under  the  mapping  w  i->  Fx( w)r, 
where  the  mappings  Fx  were  introduced  in  Theorem  8.5.  The  latter  theorem  also 
shows  that  the  law  of  Xf  depends  continuously  on  the  pair  (x,  t). 

Theorem  8.6  Assume  that  {Xt)t>o  is  a  solution  of  E(p,  b)  on  a  (complete)  filtered 
probability  space  (£2,  (dFt),P).  Then  (Xt)t>o  is  a  Markov  process  with  respect 

to  the  filtration  (#)),  with  semigroup  o  defined  by 


Qtfix)  =  E\f(Xxt)}, 


where  Xx  is  an  arbitrary  solution  ofEx((j,  b). 

Remark  With  the  notation  of  Theorem  8.5,  we  have  also 


f(Fx( w)()  W(dw). 


(8.2) 


Proof  We  first  verify  that,  for  any  bounded  measurable  function  /  on  Mf1,  and  for 
every  s,t  >  0,  we  have 


E\f(Xs+t)  I  &s]  =  Qtf(Xs ), 

where  Qf  is  defined  by  (8.2).  To  this  end,  we  fix  s  >  0  and  we  write,  for  every 
t  >  0, 


Xs+t  —  Xs  + 


o(Xr)  d Br  + 


(8.3) 


where  B  is  an  (J^) -Brownian  motion  starting  from  0.  We  then  set,  for  every  t  >  0, 


s+t , 


We  observe  that  the  filtration  (#"/)  is  complete  (of  course  =  ^oo),  that  the 
process  X'  is  adapted  to  (^/),  and  that  B'  is  an  ra-dimensional  (^/)-Brownian 
motion.  Furthermore,  using  the  approximation  results  for  the  stochastic  integral 
of  adapted  processes  with  continuous  sample  paths  (Proposition  5.9),  one  easily 
verifies  that,  a.s.  for  every  t  >  0, 

f  a  (Xr)dBr=  (  n(X'u)dB'u 
Js  JO 

where  the  stochastic  integral  in  the  right-hand  side  is  computed  in  the  filtration  (#/). 
It  follows  from  (8.3)  that 


d  u. 
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Hence  X'  solves  E(a,b ),  on  the  space  (X2,  (#"/),  P)  and  with  the  Brownian 

motion  B\  with  initial  value  X'0  —  Xs  (note  that  Xs  is  -measurable).  By  the  last 
assertion  of  Theorem  8.5,  we  must  have  X ’  —  FXs{B '),  a.s. 

Consequently,  for  every  t  >  0, 


mXs+d \&]  =  E\f(X[) \&s]  =  E\f(FXs(B')t) \ft]  =  j f(FXs( w),)  W( dw) 

=  Qtf(Xs), 


by  the  definition  of  Qf.  In  the  third  equality,  we  used  the  fact  that  B'  is  independent 
of  J^s.,  and  distributed  according  to  W(dw),  whereas  Xs  is  ^-measurable. 

We  still  have  to  verify  that  (Qt)t> o  is  a  transition  semigroup.  Properties  (i)  and 
(iii)  of  the  definition  are  immediate  (for  (iii),  we  use  the  fact  that  the  law  of  Xxt 
depends  continuously  on  the  pair  (x,  t)).  For  the  Chapman-Kolmogorov  relation, 
we  observe  that,  by  applying  the  preceding  considerations  to  Xx,  we  have,  for  every 
s,t  >  0, 


Qt+Sf(x)  =  E\f(Xxs+t)]  =  E[E\f(Xx+t) \&]\  =  E[Qf(Xl)}  =  j  Qs(x,dy)Qtf(y). 

This  completes  the  proof.  □ 

We  write  C2c  ( W /)  for  the  space  of  all  twice  continuously  differentiable  functions 
with  compact  support  on  W1. 

Theorem  8.7  The  semigroup  (Qt)t> o  is  Feller.  Furthermore,  its  generator  F  is  such 
that 


C^(Md)  C  D(L) 


and,  for  every  f  €  Ct2  (R'/), 


Lf(x)  = 


\  y^(CTCT*),;(x) 

i,j=  1 


d 

(x)  +  y]  bj(x) 

i=  1 


where  a*  denotes  the  transpose  of  the  matrix  a. 


Proof  For  the  sake  of  simplicity,  we  give  the  proof  only  in  the  case  when  o  and 
b  are  bounded.  We  fix/  e  Co(MJ)  and  we  first  verify  that  Qf  e  Co(MJ).  Since 
the  mappings  v  i-^  Fx( w)  are  continuous,  formula  (8.2)  and  dominated  convergence 
show  that  Qf  is  continuous.  Then,  since 

o(Xx)&Bs+  [  b(Xx)ds, 

Jo 
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and  cr  and  b  are  assumed  to  be  bounded,  we  get  the  existence  of  a  constant  C,  which 
does  not  depend  on  t,  x ,  such  that 

E[(Xxt  -  xf]  <  C(t  +  t2).  (8.4) 

Using  Markov’s  inequality,  we  have  thus,  for  every  t  >  0, 


sup  P( \Xxt 

xeRd 


X 


>  A)  — >  0. 


-oo 


Writing 


W(x)\  =  \E\f{Xx))\  <  \E\f(Xx)lm_AsA}]\  +  \[f\\P(\Xx-x\>A), 


we  get,  using  our  assumption/  e  Co(Rflf), 


lim  sup  | QJ (x) |  <  |[/||  sup  P(\Xxt  -  x\  >  A), 

xeRd 


x — ^oo 


and  thus,  since  A  was  arbitrary, 


lim  Q,f(x)  =  0, 

x— >oo 


which  completes  the  proof  of  the  property  Qf  e  Co(MJ). 

Let  us  show  similarly  that  Qf  — >f  when  t  ->  0.  For  every  s  >  0, 


sup  |  E\f{Xxt)\  -f{x)  |  <  sup  | fix)  —fiy)\  +  2|1/||  sup  P(\Xxt  -  x\  >  e) 


xe 


x,v€Rd,|x— y\<e 


xe 


However,  using  (8.4)  and  Markov’s  inequality  again,  we  get 


sup  P(\X*  —  x  >  s)  — >  0, 

X6^ 


hence 


lim  sup  112/ -/|| 

t-+ o 


lim  sup  (  sup  \E\f(Xf]  -f(x)  |)  <  sup  [ f(x)  -f(y)] 

t—^0  xeMd  x,/GR^,|x— /|<e 


eRd 


which  can  be  made  arbitrarily  close  to  0  by  taking  s  small. 

Let  us  prove  the  second  assertion  of  the  theorem.  Let  /  e  C^(MJ).  We  apply 
Ito’s  formula  to  /(Xf),  recalling  that,  if  X f  =  (Xf’1, . . .  ,Xxt'd),  we  have,  for  every 
i  G  { 1 , . . . ,  d }, 

m  nt  n  t 

Xf  =xt  +  J2  +  /  W)  ds. 

■  1  Jo  Jo 
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We  get 


m  =  f(x) + Mt + 


d  /»; 

u 


bi(Xf)^-(X*)ds  +  - 


dx; 


-  E  f 

2  h 


a2/ 


dxidxi' 


(Xx)d{XxfXxJ  ), 


where  M  is  a  continuous  local  martingale.  Moreover,  if  i,  i!  G 


W 


=  E CT« iVOarjV®  ds  =  d.v. 

7=1 

We  thus  see  that,  if  g  is  the  function  defined  by 


1  d  d2f  d  df 

8 (x)  =  -  E  (ctOh'M  a  a  ,  (x)  +  E^'W 


the  process 


Mt  —  f{Xx)  —  fix)  —  f  g(Xx)ds 

J  o 

is  a  continuous  local  martingale.  Since  /  and  g  are  bounded,  Proposition  4.7  (ii) 
shows  that  M  is  a  martingale.  It  now  follows  from  Theorem  6.14  that /  e  D(L)  and 
Lf  =  g.  n 

Corollary  8.8  Suppose  that  (Xt)t>o  solves  E(a,b)  on  a  filtered  probability  space 
(£2 ,  SX ,  P).  Then  (Xt)t>o  satisfies  the  strong  Markov  property:  If  T  is  a 

stopping  time  and  if(P  is  a  Borel  measurable  function  from  C(R+,  M.d)  into  R+, 

^[l{r<oo}^(^r+t»  t  >  0)  |  &t\  —  l{r<oo} 

where,  for  every  x  G  W1,  PA  denotes  the  law  on  C(R+,  Wl)  of  an  arbitrary  solution 
of  Ex(cr,  b). 

Proof  It  suffices  to  apply  Theorem  6.17.  Alternatively,  we  could  also  argue  in  a 
similar  manner  as  in  the  proof  of  Theorem  8.6,  letting  the  stopping  time  T  play  the 
same  role  as  the  deterministic  time  s  in  the  latter  proof,  and  using  the  strong  Markov 
property  of  Brownian  motion.  □ 

Markov  processes  with  continuous  sample  paths  that  are  obtained  as  solutions 
of  stochastic  differential  equations  are  sometimes  called  diffusion  processes  (certain 
authors  call  a  diffusion  process  any  strong  Markov  process  with  continuous  sample 
paths  in  W1  or  on  a  manifold).  Note  that,  even  in  the  Lipschitz  setting  considered 
here,  Theorem  8.7  does  not  completely  identify  the  generator  L,  but  only  its  action 
on  a  subset  of  the  domain  D(L ):  As  we  already  mentioned  in  Chap.  6,  it  is  often  very 
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difficult  to  give  a  complete  description  of  the  domain.  However,  in  many  instances, 
one  can  show  that  a  partial  knowledge  of  the  generator,  such  as  the  one  given  by 
Theorem  8.7,  suffices  to  characterize  the  law  of  the  process.  This  observation  is  at 
the  core  of  the  powerful  theory  of  martingale  problems,  which  is  developed  in  the 
classical  book  [77]  by  Stroock  and  Varadhan. 

At  least  when  restricted  to  C^(MJ),  the  generator  L  is  a  second  order  differential 
operator.  The  stochastic  differential  equation  E(a,  b )  allows  one  to  give  a  proba¬ 
bilistic  approach  (as  well  as  an  interpretation)  to  many  analytic  results  concerning 
this  differential  operator,  in  the  spirit  of  the  connections  between  Brownian  motion 
and  the  Laplace  operator  described  in  the  previous  chapter.  We  refer  to  Durrett 
[18,  Chapter  9]  and  Friedman  [26,  27]  for  more  about  links  between  stochastic 
differential  equations  and  partial  differential  equations.  These  connections  between 
probability  and  analysis  were  an  important  motivation  for  the  definition  and  study 
of  stochastic  differential  equations. 


8.4  A  Few  Examples  of  Stochastic  Differential  Equations 

In  this  section,  we  briefly  discuss  three  important  examples,  all  in  dimension  one. 
In  the  first  two  examples,  one  can  obtain  an  explicit  formula  for  the  solution,  which 
is  of  course  not  the  case  in  general. 


8.4.1  The  Ornstein-Uhlenbeck  Process 

Let  A  >  0.  The  (one-dimensional)  Ornstein-Uhlenbeck  process  is  the  solution  of 
the  stochastic  differential  equation 


dXt  =  d Bt  -  XXt  d t. 


This  equation  is  solved  by  applying  Ito’s  formula  to  eXtXt,  and  we  get 


Note  that  the  stochastic  integral  is  a  Wiener  integral  (the  integrand  is  deterministic), 
which  thus  belongs  to  the  Gaussian  space  generated  by  B. 
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First  consider  the  case  where  Xo  =  x  e  R.  By  the  previous  remark,  X  is  a  (non- 
centered)  Gaussian  process,  whose  mean  function  is  m(t )  =  E[Xt]  =  xo~Xt\  and 
whose  covariance  function  is  also  easy  to  compute: 

g—X\t—s\  _ A (t+s) 

K(s.  t)  =  cov(Xs,Xt)  =  - — - . 

It  is  also  interesting  to  consider  the  case  when  Xo  is  distributed  according  to 
yK(0,  In  that  case,  X  is  a  centered  Gaussian  process  with  covariance  function 


_ e-A|M 

2A 

Notice  that  this  is  a  stationary  covariance  function.  In  that  case,  the  Ornstein- 
Uhlenbeck  process  X  is  both  a  stationary  Gaussian  process  (indexed  by  R+)  and 
a  Markov  process. 


8.4.2  Geometric  Brownian  Motion 

Let  a  >  0  and  r  e  R.  The  geometric  Brownian  motion  with  parameters  a  and  r  is 
the  solution  of  the  stochastic  differential  equation 


dXt  =  oXt  d Bt  +  rXt  d  t. 


One  solves  this  equation  by  applying  Ito’s  formula  to  logX^  (say  in  the  case  where 
Xo  >  0),  and  it  follows  that: 

2 

X,  =  Xo  exp  ( oB ,  +  (r  -  y)/) . 

Note  in  particular  that,  if  the  initial  value  X0  is  (strictly)  positive,  the  solution 
remains  so  at  every  time  t  >  0.  Geometric  Brownian  motion  is  used  in  the  celebrated 
Black-Scholes  model  of  financial  mathematics.  The  reason  for  the  use  of  this 
process  comes  from  an  economic  assumption  of  independence  of  the  successive 
ratios 


xt2  -  X q  Xt3  ~  xt2 
xn  ’  xl2 


corresponding  to  disjoint  time  intervals:  From  the  explicit  formula  for  Xt,  we  see 
that  this  is  nothing  but  the  property  of  independence  of  increments  of  Brownian 
motion. 
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8.4.3  Bessel  Processes 

Let  m  >  0  be  a  real  number.  The  m- dimensional  squared  Bessel  process  is  the  real 
process  taking  nonnegative  values  that  solves  the  stochastic  differential  equation 

dX,  =  2y[x,diBt  +mdt .  (8.5) 

Notice  that  this  equation  does  not  fit  into  the  Lipschitz  setting  studied  in  this  chapter, 
because  the  function  a(v)  =  2^/x  is  not  Lipschitz  over  M+  (one  might  also  observe 
that  this  function  is  only  defined  on  M+  and  not  on  R,  but  this  is  a  minor  point 
because  one  can  replace  2^/x  by  2  yf jvf  and  check  a  posteriori  that  a  solution  starting 
from  a  nonnegative  value  stays  nonnegative).  However,  there  exist  (especially  in 
dimension  one)  criteria  weaker  than  our  Lipschitz  continuity  assumptions,  which 
apply  to  (8.5)  and  give  the  existence  and  pathwise  uniqueness  of  solutions  of  (8.5). 
See  in  particular  Exercise  8.14  for  a  criterion  of  pathwise  uniqueness  that  applies 
to  (8.5). 

One  of  the  main  reasons  for  studying  Bessel  processes  comes  from  the  following 
observation.  If  d  >  1  is  an  integer  and  /3  =  (/31 , . . . ,  /3d)  is  a  d-dimensional 
Brownian  motion,  an  application  of  Ito’s  formula  shows  that  the  process 

m2  =  V})2  +  -  +  (Pf)2 


is  a  d-dimensional  squared  Bessel  process:  See  Exercise  5.33.  Furthermore,  one  can 
also  check  that,  when  m  —  0,  the  process  (^ Xt)t>o  has  the  same  distribution  as 
Feller’s  branching  diffusion  discussed  at  the  end  of  Chap.  6  (see  Exercise  8.1 1). 

Suppose  from  now  on  that  m  >  0  and  Xo  —  x  >  0.  For  every  r  >  0,  set 
Tr  inf{f  >  0  :  Xt  =  r}.  If  r  >  x,  we  have  P(Tr  <  oo)  =  1.  To  get  this,  use  (8.5) 
to  see  that  XtATr  —  x  +  m(t  A  Tr )  +  YtATr ,  where  E[(YtAT/.)2]  <  Art.  By  Markov’s 
inequality,  P(YtATr  >  t 3^4)  — >  0  as  t  ->  oo,  and  if  we  assume  that  P(Tr  —  oo)  >  0 
the  preceding  expression  for  XtATr  gives  a  contradiction. 

Set,  for  every  t  E  [0,  7q), 


M  =  {  (X)1  ”  ifm  ±  2, 

1  |  log(X,)  if  m  —  2. 

It  follows  from  Ito’s  formula  that,  for  every  e  e  (0,v),  MtATs  is  a  continuous 
local  martingale.  This  continuous  local  martingale  is  bounded  over  the  time  interval 
[0  ,Te  A  Ta],  for  every  A  >  x,  and  an  application  of  the  optional  stopping  theorem 
(using  the  fact  that  Ta  <  oo  a.s.)  gives,  if  m  ^  2, 


228 


8  Stochastic  Differential  Equations 


and  if  m  —  2, 


P(Te  <  Ta)  = 


log  A  —  log* 
log  A  —  logs 


When  m  —  d  is  an  integer,  we  recover  the  formulas  of  Proposition  7.16. 

Let  us  finally  concentrate  on  the  case  m  >  2.  Letting  s  go  to  0  in  the  preceding 
formulas,  we  obtain  that  P(Tq  <  oo)  =  0.  If  we  let  A  tend  to  oo,  we  also  get  that 
P(TS  <  oo)  =  1  if  m  —  2  (as  we  already  noticed  in  Chap.  7)  and  P(T£  <  oo)  = 
(£ / x)(m/2)-1  if  m  >  2. 

It  then  follows  from  the  property  P(7o  <  oo)  =  0  that  the  process  Mt  is  well- 
defined  for  every  t  >  0  and  is  a  continuous  local  martingale.  When  m  >  2,  Mt 
takes  nonnegative  values  and  is  thus  a  supermartingale  (Proposition  4.7  (i)),  which 
converges  a.s.  as  t  ->  oo  (Proposition  3.19).  The  limit  must  be  0,  since  we  already 
noticed  that  P(TA  <  oo)  =  1  for  every  A  >  x,  and  we  conclude  that  Xt  converges 
a.s.  to  oo  as  t  ->  oo  when  m  >  2.  One  can  show  that  the  continuous  local  martingale 
Mt  is  not  a  (true)  martingale  (cf.  Question  8.  in  Exercise  5.33  in  the  case  m  —  3). 

Exercise  5.31  in  Chap.  5  gives  a  number  of  important  calculations  related  to 
squared  Bessel  processes.  We  refer  to  Chapter  XI  in  [70]  for  a  thorough  study  of 
this  class  of  processes. 

Remark  The  m- dimensional  Bessel  process  is  (of  course)  obtained  by  taking 
Yt  —  +JXt,  and,  when  m  —  d  is  a  positive  integer,  it  corresponds  to  the  norm  of  d- 
dimensional  Brownian  motion.  When  m  >  1,  the  process  Y  also  satisfies  a  stochastic 
differential  equation,  which  is  however  less  tractable  than  (8.5):  See  Exercise  8.13 
below. 


Exercises 

Exercise  8.9  ( Time  change  method)  We  consider  the  stochastic  differential  equa¬ 
tion 

E(<J,  0)  dXt  =  cr(Xt)dBt 

where  the  function  a  :  R  — >  R  is  continuous  and  there  exist  constants  £  >  0  and 
M  such  that  s  <  o  <  M. 

1.  In  this  question  and  the  next  one,  we  assume  that  X  solves  E((j,  0)  with  Xq>  =  x. 
We  set,  for  every  t  >  0, 

At  =  f  <j(Xs)2  d s  ,  xt  —  inf{s  >  0  :  As  >  t}. 

Jo 
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Justify  the  equalities 


=  f 


dr 


,  At  —  infi^  >  0 


0  a(Xr  )2  -  Jo  o(XT  )2 


■L 


dr 


>  t}. 


2.  Show  that  there  exists  a  real  Brownian  motion  /3  =  (Pt)t> o  started  from  v  such 
that,  a.s.  for  every  t  >  0, 


Xt  Pinf{s>0:fQ  <r(/3r)  2d r>t) 


3.  Show  that  weak  existence  and  weak  uniqueness  hold  for  E(a,  0).  (Hint:  For 
the  existence  part,  observe  that,  if  X  is  defined  from  a  Brownian  motion  /3  by 
the  formula  of  question  2.,  X  is  (in  an  appropriate  filtration)  a  continuous  local 
martingale  with  quadratic  variation  (X,  X)t  =  cr(2Q2ds.) 

Exercise  8.10  We  consider  the  stochastic  differential  equation 
E(cr,  b )  dXt  =  or(Xt)  d Bt  +  b(Xt)  d t 

where  the  functions  a,b  :  R  — >  R  are  bounded  and  continuous,  and  such  that 
fR  \b(x)\dx  <  oo  and  a  >  s  for  some  constant  e  >  0. 

1.  Let  X  be  a  solution  of  E(a,b).  Show  that  there  exists  a  monotone  increasing 
function  F  :  R  — »  R,  which  is  also  twice  continuously  differentiable,  such  that 
F(Xt )  is  a  martingale.  Give  an  explicit  formula  for  F  in  terms  of  a  and  b. 

2.  Show  that  the  process  Yt  =  F(Xt )  solves  a  stochastic  differential  equation  of  the 
form  dYt  —  o\Yt )  d£?,  with  a  function  cF  to  be  determined. 

3.  Using  the  result  of  the  preceding  exercise,  show  that  weak  existence  and  weak 
uniqueness  hold  for  E(cr,  b).  Show  that  path  wise  uniqueness  also  holds  if  a  is 
Lipschitz. 

Exercise  8.11  We  suppose  that,  for  every  v  e  R+,  one  can  construct  on  the  same 
filtered  probability  space  (X2,  (^t) ,  P)  a  process  Xx  taking  nonnegative  values, 

which  solves  the  stochastic  differential  equation 


(  dxt  =  V2 X,dB, 

(  X0  =  x 

and  that  the  processes  Xx  are  Markov  processes  with  values  in  M+ ,  with  the  same 
semigroup  (Qt)t>o,  with  respect  to  the  filtration  (J^).  (This  is,  of  course,  close  to 
Theorem  8.6,  which  however  cannot  be  applied  directly  because  the  function  \flx 
is  not  Lipschitz.) 

1.  We  fix  x  G  R+,  and  a  real  T  >  0.  We  set,  for  every  t  e  [0,  T\ 

(  \ 

Mt  —  exp  ( - — — - -  ) . 

l+A  (T-t)J 

Show  that  the  process  (MtAT)t> o  is  a  martingale. 
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2.  Show  that  (Qt)t>o  is  the  semigroup  of  Feller’s  branching  diffusion  (see  the  end 
of  Chap.  6). 


Exercise  8.12  We  consider  two  sequences  (<Jn)n>  t  and  (bn)n>  i  of  real  functions 
defined  on  R.  We  assume  that: 


(i) 

(ii) 


There  exists  a  constant  C  >  0  such  that 
n  >  1  and  v  G  R. 

There  exists  a  constant  K  >  0  such  that, 


crn(v)|  <  C  and  |Z?„(x)|  <  C  for  every 
for  every  n  >  1  and  xjet, 


aw(v)  —  cr„ 0 ) |  <  K\x  —  y\  ,  \bn(x)  -  bn(y) \  <  K\x-y\. 

Let  B  be  an  (J^) -Brownian  motion  and,  for  every  n  >  1,  let  X”  be  the  unique 
adapted  process  satisfying 

an(X")dBs  +  [  bn{Xns)ds. 

Jo 


1.  Let  T  >  0.  Show  that  there  exists  a  constant  A  >  0  such  that,  for  every  real 
M  >  0  and  for  every  n  >  1 , 


P 


A 

-  w 


2.  We  assume  that  the  sequences  (a„)  and  (bn)  converge  uniformly  on  every 
compact  subset  of  R  to  limiting  functions  denoted  by  a  and  b  respectively.  Justify 
the  existence  of  an  adapted  process  X  —  (Xt)t>o  with  continuous  sample  paths, 
such  that 


Xt=  f  a(Xs)d Bs+  [  b(Xs)ds , 
Jo  Jo 


then  show  that  there  exists  a  constant  A'  such  that,  for  every  real  M  >  0,  for  every 
t  e  [0,  T\  and  n  >  1, 


pt  a' 

sup(X"  —Xs)2  <  4(4  +  T)K2  /  £[(X'V'-Xv)2]d''  + 

i<t  Jo 


M2 


+4T^4  sup  (crw(v)  —  cr(v))2  +  T  sup  ( bn(x )  —  /?(v))2^ 


\x\<M 


x\<M 


3.  Infer  from  the  preceding  question  that 


lim  E 

n—>oo 


suP(x"  -  xsy 


L  s<T 


=  0. 
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Exercise  8.13  Let  /3  =  (/3t)t>o  be  an  (J^)-Brownian  motion  started  from  0.  We  fix 
two  real  parameters  a  and  r,  with  a  >  1/2  and  r  >  0.  For  every  integer  n  >  1  and 
every  x  g  M,  we  set 


A  n  . 


1.  Let  n  >  1.  Justify  the  existence  of  the  unique  semimartingale  that  solves  the 
equation 


Zf  =  r  +  ^  +  a  f  fn(Zns)  d s. 

Jo 

2.  We  set  Sn  =  inf{t  >  0  :  ZJ7  <  l/n}.  After  observing  that,  for  t  —  Sn  A  Sn-\-  ] , 

z"+,-z'=“I(AA)ds' 

show  that  ZJ?+1  =  Z”  for  every  t  e  [0,  A  S„+i],  a.s.  Infer  that  >  5n. 

3.  Let  g  be  a  twice  continuously  differentiable  function  on  R.  Show  that  the  process 

g(Zf)  -  g(r)  -  J  (ag  (Z")fi,(Z")  +  ^g"(Z^))ds 
is  a  continuous  local  martingale. 

4.  We  set  h(x)  —  xl~2a  for  every  v  >  0.  Show  that,  for  every  integer  n  >  1, 

h(Z”AS)  is  a  bounded  martingale.  Infer  that,  for  every  t  >  0,  <  t)  tends  to  0 

as  ft  -a-  oo,  and  consequently  -a-  oo  a.s.  as  n  ->  oo. 

5.  Infer  from  questions  2.  and  4.  that  there  exists  a  unique  positive  semimartingale 
Z  such  that,  for  every  t  >  0, 


„  A  dx 
Zt  —  r  -\-  pt  -\-  a  I  — . 

Jo  As 

6.  Let  d  >  3  and  let  P  be  a  d-dimensional  Brownian  motion  started  from  y  G 
MJ\{0}.  Show  that  T?  =  |Pr|  satisfies  the  stochastic  equation  in  question  5.  (with 
an  appropriate  choice  of  /3)  with  r  =  |y|  and  a  —  (d  —  1) / 2.  One  may  use  the 
results  of  Exercise  5.33. 

Exercise  8.14  ( Yamada-Watanabe  uniqueness  criterion)  The  goal  of  the  exercise 
is  to  get  pathwise  uniqueness  for  the  one-dimensional  stochastic  differential  equa¬ 
tion 


dXt  =  cr(Xt)d Bt  +  b(Xt)dt 
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when  the  functions  a  and  b  satisfy  the  conditions 

\a(x)  -<r(y)|  <  K  V|x-y|  ,  \b{x)  -b(y)\  <  K\x-y\  , 

for  every  xjgI,  with  a  constant  K  <  oo. 

1.  Preliminary  question.  Let  Z  be  a  semimartingale  such  that  ( Z,Z)t  —  f^hsds, 
where  0  <  hs  <  C  \ZS\,  with  a  constant  C  <  o o.  Show  that,  for  every  t  >  0, 


lim  nE  if  l{o<|zd<iMd(Z,Z),]  —  0. 


(Hint:  Observe  that,  for  every  n  >  1 , 


E  [  \ZS\  11{0<|zv|<i}  d(Z,  Z)s  <  C t  <  oo.) 

LJ  o  J 


2.  For  every  integer  n  >  1 ,  let  <pn  be  the  function  defined  on  R  by 


f  0  if  \x\  >  l//i, 

=  <  2n(l  —  nx)  if  0  <  v  <  1/n, 

[  2n(\  +  nv)  if  —  l/n  <  x  <  0. 

Also  write  Fn  for  the  unique  twice  continuously  differentiable  function  on  R 
such  that  Fn( 0)  =  F^(0)  =  0  and  E[[  =  cpn.  Note  that,  for  every  xeR,  one  has 
Fn(x)  — >  |*|  and  F'n(x)  — >  sgn(v)  :=  1{X>0}  -  1{X<0}  when  n  ->  oo. 

Let  X  and  X ’  be  two  solutions  of  E(cr,  b)  on  the  same  filtered  probability  space 
and  with  the  same  Brownian  motion  B.  Infer  from  question  1.  that 

lim  [  <pn (Xs  -  X')  d(X  —  X',X  —  X')s]  =  0. 
n-*oc  L  Jo  J 

3.  Let  T  be  a  stopping  time  such  that  the  semimartingale  XtAT  —  X'tAT  is  bounded. 
By  applying  Ito’s  formula  to  Fn(XtAT  —  X'tAT),  show  that 

ftAT 

E[\XtAT  -  X'tAT\]  =  E[\Xo  ~X'0\ )+E  /  (b(Xs)  -  b(X'))  sgn(Xs  -  X')  ds  . 

LJ  o  J 

4.  Using  Gronwall’s  lemma,  show  that,  if  Xo  =  Xf0,  one  has  Xt  =  X[  for  every  t  >  0, 
a.s. 
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Notes  and  Comments 

As  already  mentioned,  the  treatment  of  stochastic  differential  equations  motivated 
Ito’s  invention  of  stochastic  differential  equations.  For  further  reading  on  this 
topic,  the  reader  may  consult  the  classical  books  of  Ikeda  and  Watanabe  [43]  and 
Stroock  and  Varadhan  [77],  the  latter  studying  stochastic  differential  equations 
in  connection  with  martingale  problems.  0ksendal’s  book  [66]  emphasizes  the 
applications  of  stochastic  differential  equations  in  other  fields.  The  books  [26,  27] 
of  Friedman  focus  on  connections  with  partial  differential  equations.  We  have 
chosen  to  concentrate  on  the  Lipschitz  case,  where  the  main  results  of  existence 
and  uniqueness  were  already  obtained  by  Ito  [37,  38].  In  dimension  one,  the  criteria 
ensuring  pathwise  uniqueness  can  be  weakened  significantly  (see  in  particular 
Yamada  and  Watanabe  [83],  which  inspired  Exercise  8.14)  but  this  is  no  longer  the 
case  in  higher  dimensions.  Chapter  XI  of  [70]  contains  a  lot  of  information  about 
Bessel  processes. 


Chapter  9 

Local  Times 


In  this  chapter,  we  apply  stochastic  calculus  to  the  theory  of  local  times  of 
continuous  semimartingales.  Roughly  speaking,  the  local  time  at  level  a  of  a 
semimartingale  X  is  an  increasing  process  that  measures  the  “number  of  visits”  of 
X  at  level  a.  We  use  the  classical  Tanaka  formulas  to  construct  local  times  and  then 
to  study  their  regularity  properties  with  respect  to  the  space  variable.  We  show  how 
local  times  can  be  used  to  obtain  a  generalized  version  of  Ito’s  formula,  and  we 
establish  the  so-called  density  of  occupation  time  formula.  We  also  give  several 
approximations  of  local  times.  We  then  focus  on  the  case  of  Brownian  motion, 
where  we  state  the  classical  Trotter  theorem  as  a  corollary  of  our  results  for  general 
semimartingales,  and  we  derive  the  famous  Levy  theorem  identifying  the  law  of  the 
Brownian  local  time  process  at  level  0.  In  the  last  section,  we  use  Brownian  local 
times  to  prove  the  Kallianpur-Robbins  law  that  was  stated  at  the  end  of  Chap.  7. 
This  chapter  can  be  read  independently  of  Chaps.  6,  7  and  8,  except  for  the  last 
section  that  relies  on  Chap.  7. 


9.1  Tanaka’s  Formula  and  the  Definition  of  Local  Times 

Throughout  this  chapter,  we  argue  on  a  filtered  probability  space  (T2,  o,  P), 

and  the  filtration  (^t)t> o  is  assumed  to  be  complete.  Let  X  be  a  continuous 
semimartingale.  If  /  is  a  twice  continuously  differentiable  function  defined  on 
R,  Ito’s  formula  asserts  that  f(Xt)  is  still  a  continuous  semimartingale,  and 

f(Xt)=f(X 0)+  [  f'Q QdXs+]-  [  f"(Xs)  d(X,X)s. 

Jo  ^  Jo 

The  next  proposition  shows  that  this  formula  can  be  extended  to  the  case  when  /  is 
a  convex  function. 
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Proposition  9.1  Let  f  be  a  convex  function  on  R.  Thenf(Xt)  is  a  semimartingale, 
and,  more  precisely,  there  exists  an  increasing  process  A f  such  that,  for  every  t  >  0, 


fix,)  = 


fL(Xs)  dxs  +  Aft , 


where  f!_(x)  denotes  the  left- derivative  off  at  x. 

More  generally,  f(Xt)  is  a  semimartingale  iff  is  a  difference  of  convex  functions. 

Proof  Let  He  a  nonnegative  continuous  function  on  R  such  that  h(x)  —  0  if  v  £ 
[0,  1]  and  h(x)  dx  =  1.  For  every  integer  n  >  1,  set  hn(x)  —  nh(nx).  Define  a 
function  cpn  :  R  — >  R  by 


<Pn(x)  =  h„  *f(x)  =  /  hn(y)f(x-y)dy. 

Jr 

Then  it  is  elementary  to  verify  that  (pn  is  twice  continuously  differentiable  on  R, 
( p'n  —  hn  *f!_,  and  (pn(x)  — >  f(x),  <p'n(x)  — >  f_{x)  as  n  ->  oo,  for  every  v  e  R. 
Furthermore,  the  functions  <pn  are  also  convex,  so  that  (f[[  >  0. 

Let  X  =  M  +  V  be  the  canonical  decomposition  of  the  semimartingale  X ,  and 
consider  an  integer  K  >  1 .  Introduce  the  stopping  time 

Tk  :=  inf{t  >  0  :  |X,|  +  +  [  \dVs\  >  X}. 

Jo 

From  Ito’s  formula,  we  have 


ptATK  j  rtATx 

<Pn(XtATK )  =  (pn(Xo)  +  /  ^(X5)  dx,  +  -  /  d(M,  M)s.  (9.1) 

From  the  definition  of  7^,  we  have  (M,  M}^  <  X.  Noting  that  the  functions  (p'n  are 
uniformly  bounded  over  any  compact  interval,  we  get,  by  a  simple  application  of 
Proposition  5.8, 


m/kTk  rtATx 

/  ?/(V)dV  — ►  /  /L(X,)dX„ 

Jo  Jo 


in  probability.  For  every  t  >0,  set 


ptATfc 

a{k  :=f(XtATK)  -fix o)  -  /  /lOO  dX 

Jo 


(9.2) 


(9.3) 
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Since  (pn{X o)  — >/(X o)  and  <Pn(XtATK)  ^  f(Xt  at K)  as  n  ->  oo,  we  deduce  from  (9.2) 
and  (9.1)  that 

1  rtATjc 

-  <p'n(Xs)d(M,M)s  — ►  4L  (9.4) 

2  J0  n-*oo 

f  K 

in  probability.  By  (9.3),  the  process  (Ay  />o  has  continuous  sample  paths,  and 
A0  —  0.  Since  cp"  >  0,  it  follows  from  the  convergence  (9.4)  that  the  sample  paths 
of  (Afp  )t>o  are  also  nondecreasing.  Hence  A^K  is  an  increasing  process.  Finally, 
one  gets  from  (9.4)  that  a{,k  —  A^aTk  if  K  <  K' .  It  follows  that  there  exists  an 

increasing  process  N  such  that  Ay  =  AJtATR  for  every  t  >  0  and  K  >  1.  We  then 
get  the  formula  of  the  proposition  by  letting  K  ->  oo  in  (9.3).  □ 

Remark  Write  f+  for  the  right-derivative  of/.  An  argument  similar  to  the  preceding 
proof  shows  that  there  exists  an  increasing  process  A^  such  that 


/ 


kx, )  =/(x o)  +  /  ./:(.vv)d\\  +  ,\f. 


If  /  is  twice  continuously  differentiable,  A,  =  A,  =  d  f'Qf"(Xs)  d(A\  A")v.  In  general, 

~  f  f  ^ 

however,  we  may  have  At  ^  AJ? . 

The  previous  proposition  leads  to  an  easy  definition  of  the  local  times  of  a 
semimartingale.  For  every  v  e  R,  we  set  sgn(v)  :=  l{x>o}  —  l{x<o}  (the  fact  that 
we  define  sgn(0)  =  —  1  here  plays  a  significant  role). 

Proposition  9.2  Let  X  be  a  continuous  semimartingale  and  ael  There  exists  an 
increasing  process  (Lat(X))t> o  such  that  the  following  three  identities  hold: 


\Xt-a\  =  |X0- 


a 


+  [  sgn(Xs  —  a)  dX5  +  Lat  (X), 

Jo 


( Xt-ay  =  (Xo- 


a)+  +  f  l{Xs>a]dXs+L“(X), 

f  1 

(X,  -  a)~  =  (X0  -a)~-  /  l{Xs<fl} dXs  +  -Lat(X). 

J0  A 


(9.5) 

(9.6) 

(9.7) 


The  increasing  process  (L“(X)),> o  is  called  the  local  time  of  X  at  level  a. 
Furthermore,  for  every  stopping  time  T,  we  have  If(XT)  —  L"/  r(X). 
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We  will  refer  to  any  of  the  identities  (9.5),  (9.6),  (9.7)  as  Tanaka’s  formula. 

Proof  We  apply  Proposition  9.1  to  the  convex  function  f(x)  =  \x  —  a\,  noting  that 
f_{x)  —  sgn(v  —  a).  It  follows  from  Proposition  9.1  that  the  process  (L“(X))t> o 
defined  by 


Lat{X)  :=  \Xt-a 


\Xq  -  a 


—  a)  dXs 


is  an  increasing  process.  We  then  need  to  verify  that  (9.6)  and  (9.7)  also  hold.  To 
this  end,  we  apply  Proposition  9.1  to  the  convex  functions /(v)  =  (x  —  a)+  and 
f{x)  =  (x  —  a)~ .  It  follows  that  there  exist  two  increasing  processes  Aa,(+)  and 
such  that 


(X,  -  a)+  =  (X0  -a)++  f  l{Xs>a}  dXs  +  Aat’{+) 

Jo 


and 


(Xt  —  a)  —  (Xo  —  a)  —  f  1  {xs<a}  d Xs  +  At^  ^ 

Jo 


By  considering  the  difference  between  the  last  two  displays,  we  immediately 
get  that  AaJ+)  =  AaJ~] .  On  the  other  hand,  if  we  add  these  two  displays  and 
compare  with  (9.5),  we  get  A^+)  +  Aat^  ^  —  L^(X).  Hence  A^+)  =  Aat^  )  — 
\L“(X). 

The  last  assertion  immediately  follows  from  (9.5)  since  j^T  sgn(X5  —  a)  dXs  = 
sgn(Xj  —  a)  dXj  by  properties  of  the  stochastic  integral.  □ 

Let  us  state  the  key  property  of  local  times.  We  use  the  notation  d^(X)  for 
the  random  measure  associated  with  the  increasing  function  s  i->  L“S(X)  (i.e. 

fM  W  =  OT)- 

Proposition  9.3  Let  X  be  a  continuous  semimartingale  and  let  a  e  R.  Then  a.s.  the 
random  measure  d5L"(X)  is  supported  on  {s  >  0  :  Xs  =  a}. 

Proof  Set  Wt  =  \Xt  —  a\  and  note  that  (9.5)  gives  (W,W)t  —  {X,X)t  since 
| sgn(jc)  |  =  1  for  every  x  e  M.  By  applying  Ito’s  formula  to  (Wt)2,  we  get 


(X,  -  a)2  =  wf  =  ( X0  -a)2 +  2 


dXs  +  2 


a\dsLas(X)  +  (X,X)t. 
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Comparing  with  the  result  of  a  direct  application  of  Ito’s  formula  to  (Xt  —  a)2,  we 
get 


a 


d  sLaAX)  =  0, 


which  gives  the  desired  result.  □ 

Proposition  9.3  shows  that  the  function  t  i->  Lat{X)  may  only  increase  when 
Xt  —  a .  So  in  some  sense,  L“(X)  measures  the  “number  of  visits”  of  the  process  X 
at  level  a  before  time  t  (the  results  of  Sect.  9.3  give  rigorous  versions  of  this  intuitive 
statement).  This  also  justifies  the  name  “local  time”. 


9.2  Continuity  of  Local  Times  and  the  Generalized 
Ito  Formula 


We  consider  a  continuous  semimartingale  X  and  write  X  —  M  +  V  for  its  canonical 
decomposition.  Our  first  goal  is  to  study  the  continuity  of  the  local  times  of  X  with 
respect  to  the  space  variable  a. 

It  is  convenient  to  write  La(X )  for  the  random  continuous  function  (L“(X))t> 0, 
which  we  view  as  a  random  variable  with  values  in  the  space  C(R+,  R+).  As  usual, 
the  latter  space  is  equipped  with  the  topology  of  uniform  convergence  on  every 
compact  set. 

Theorem  9.4  The  process  (. La(X ),  aeR)  with  values  in  C(R+,  R+)  has  a  cadlag 
modification,  which  we  consider  from  now  on  and  for  which  we  keep  the  same 
notation  ( La(X),a  G  R).  Furthermore,  ifLa~(X )  =  (Lf~(X))t> o  denotes  the  left 
limit  ofb  — >  Lb(X )  at  a,  we  have  for  every  t  >  0, 

Lat{X)  -  La~{X)  =  2  [  l{Xs=a}  dVs.  (9.8) 

Jo 

In  particular,  ifX  is  a  continuous  local  martingale,  the  process  (Ifi(X))ae R,t>o  has 
jointly  continuous  sample  paths. 

The  proof  of  the  theorem  relies  on  Tanaka’s  formula  and  the  following  technical 
lemma. 

Lemma  9.5  Let  p  >  1.  There  exists  a  constant  Cp,  which  only  depends  on  p,  such 
that  for  every  a,  b  G  M  with  a  <  b,  we  have 


-{a<Xs<b}m,M)) 


< 


Cp(b-a)P(E[({M,M),)p/2]  +£[(  J  |dVs|)'’]). 
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For  every  a  G  R,  write  Ya  =  (!?),>  o  for  the  random  variable  with  values  in 
C(M+ ,  R)  defined  by 

Yt  =  f 

Jo 

The  process  (Ya,  a  e  R)  has  a  continuous  modification. 

Proof  Let  us  start  with  the  first  assertion.  It  is  enough  to  prove  that  the  stated  bound 
holds  when  a  —  —u  and  Z?  =  u  for  some  u  >  0  (then  take  u  —  (b  — a)/ 2  and  replace 
X  by  X  —  (b  +  a)/ 2).  Let /  be  the  unique  twice  continuously  differentiable  function 
such  that 


fix)  =  (2  -  -L)+ 

u 

and/(0)  =  /' ( 0 )  =  0.  Note  that  we  then  have  \f'{x)\  <  2m  for  every  rel.  Since 
f"  >  0  and  fix)  >  1  if  —  u  <  x  <  u,  we  have 


l{-«<xs<„}d(M;M>s  <  f  f  (Xs)d(M ,  M)s. 

Jo 


However,  by  Ito’s  formula, 


(9.9) 


l  ff(Xs)d(M,M)s 

Jo 


=  fiX,)  -fiX o)  - 


f\Xs)dXs. 


Recalling  that  \f\  <  2m,  we  have 


(9.10) 


E[\fiX<)  -/(^o)|p]  <  Guf  E[\X,  —  XoH 

<  (2m)p£[(|M,-M0|  +J  |dV,|)P 


< 


Cp(2uy  (E[i{M,M)t)p/2]  +£[(  [  |dK|)P]), 


using  the  Burkholder-Davis-Gundy  inequalities  (Theorem  5.16).  Here  and  below, 
Cp  stands  for  a  constant  that  depends  only  on  p ,  which  may  vary  from  line  to  line. 
Then, 
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We  have 


/v 


(V)  dVs 


<  {2u  f  E 


( I'  |dvJ)1 


and,  using  the  Burkholder-Davis-Gundy  inequalities  once  again, 


/'/ 


(V)  d  Ms 


p- 


<  CpE 


(  ff{Xs)2d{M,M)Jj 


p!  2l 


<  CJ2uY  E{({M,M),y/2] 


The  first  assertion  of  the  lemma  follows  by  combining  the  previous  bounds, 
using  (9.9)  and  (9.10). 

Let  us  turn  to  the  second  assertion.  We  fix  p  >  2.  By  the  Burkholder-Davis- 
Gundy  inequalities,  we  have  for  every  a  <  b  and  every  t  >  0, 


sup 

L  s<t 


Yb  -  Y 

s  s 


a 


<  CpE 


(  j  l{a<xs<b}d(M,  M)SJ/2 


(9.11) 


and  the  right-hand  side  can  be  estimated  from  the  first  assertion  of  the  lemma.  More 
precisely,  for  every  integer  n  >  1 ,  introduce  the  stopping  time 

Tn  :=  inf{f  >  0  :  (M, M)t  +  f  \dVs\  >  n}. 

Jo 

From  the  first  assertion  of  the  lemma  with  X  replaced  by  the  stopped  process  XTn , 
we  have,  for  every  t  >  0, 


r  A  Tn 


-{ a<Xs<b ] 


}d{M,M)s)P/2]  <  Cp(np!A  +  np/2)  (b  -  a)p/ 2 


Using  (9.1 1),  again  with  X  replaced  by  XTn ,  and  letting  t  ->  oo,  we  obtain 


sup  \Ybr  -  Y“ 


SATn 


L  ^>0 


<  Cp(np/4  +  npl2)  (b  -  a)p/2. 


Since  p  >  2,  we  see  that  we  can  apply  Kolmogorov’s  lemma  (Theorem  2.9)  to 
get  the  existence  of  a  continuous  modification  of  the  process  a  ->  (Y“aT  )s> o,  with 

values  in  C(R+,  R).  Write  (T^’fl)5>o  for  this  continuous  modification. 

Then,  if  1  <  n  <  m,  for  every  fixed  a ,  we  have  for  every  ^  >  0, 

a.s.  By  a  continuity  argument,  the  latter  equality  holds  simultaneously  for  every 
a  G  R  and  every  s  >  0,  outside  a  single  set  of  probability  zero.  It  follows  that  we 
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can  define  a  process  (Ya,a  e  R)  with  values  in  C(R+,  R),  with  continuous  sample 
paths,  such  that,  for  every  n  >  1,  Yg,a  —  Y“ATn  for  every  a  e  R  and  every  s  >  0, 
a.s.  The  process  ( Ya ,  a  e  R)  is  the  desired  continuous  modification.  □ 

Remark  By  applying  the  bound  of  Lemma  9.5  to  XTn  (with  Tn  as  in  the  previous 
proof)  and  letting  a  tend  to  b ,  we  get  that,  for  every  be  R, 

l{xs=b}  d{M,  M)s  —  0 

for  every  t  >  0,  a.s.  Consequently,  using  Proposition  4.12,  we  also  have 

l{Xs=b}d  Ms  =  0,  (9.12) 

for  every  t  >  0,  a.s. 

Proof  of  Theorem  9.4  With  a  slight  abuse  of  notation,  we  still  write  ( Ya,a  e  R) 
for  the  continuous  modification  obtained  in  the  second  assertion  of  Lemma  9.5.  We 
also  let  (. Za ,  a  e  R)  be  the  process  with  values  in  C(R+,  R)  defined  by 

4  =  fl{Xs>a}dVs. 

JO 

By  Tanaka’s  formula,  we  have  for  every  fixed  a  e  R, 

L"  =  lUx,  -a)+  -(X o  -  a)+  -  Y‘‘  -  Z?) ,  for  every  t  >  0,  a.s. 

The  right-hand  side  of  the  last  display  provides  the  desired  cadlag  modification. 
Indeed,  the  process 

a  h*  ((X,  -  a)+  -  (X0  -  a)+  -  Yf) 

V  / 1> 0 


has  continuous  sample  paths,  and  on  the  other  hand  the  process  a  i->  Za  has  cadlag 
sample  paths:  For  every  ao  e  R,  the  dominated  convergence  theorem  shows  that 


1  {Xs>a}  dVy 
1  {Xs>a}  dVs 


- > 

a^ao 


- > 

a\ao,a<ao 


1  {Xs>a0}  dVj, 
1{XS><10} 
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uniformly  on  every  compact  time  interval.  The  previous  display  also  shows  that  the 
jump  Za°  —  Za°~  is  given  by 


zr  -  zr~  =  -  [  hxs=ao}  dVs, 

Jo 

and  this  completes  the  proof  of  the  theorem.  □ 

From  now  on,  we  only  deal  with  the  cadlag  modification  of  local  times  obtained 
in  Theorem  9.4. 

Remark  To  illustrate  Theorem  9.4,  set  Wt  —  \Xt\,  which  is  also  a  semimartingale 
by  Tanaka’s  formula  (9.5).  By  (9.6)  applied  to  Wt,  we  have 

Wt  =  m+  =  |Xo|  +  f  1  { |xs | >o } (sgn (V) dXj  +  d L°S(X))  +  \l%W) 

Jo  2 

=  \Xo\+  f  sgn(X,)d xs+  f  \{x -0}dxs  +  l-L°t(W), 

Jo  Jo  2 

noting  that  /Qr  l{|xJ>o}dT?(X)  =  0  by  the  support  property  of  local  time  (Proposi¬ 
tion  9.3).  Comparing  the  resulting  formula  with  (9.5)  written  with  a  —  0,  we  get 

L°t{W)  =  2 L%X)  -  2  f  l{Xs=0}dXs  =  L%X)  +  L°~(X), 

Jo 

using  (9.8).  The  formula  Lft(W)  —  L®(X)  +  L®~(X)  is  a  special  case  of  the  more 

general  formula  L“(W)  =  L"  (X)  +  l\  a)  (X),  for  every  a  >  0,  which  is  easily 
deduced  from  Corollary  9.7  below.  We  note  that  the  support  property  of  local  time 
implies  L“(W)  —  0  for  every  a  <  0,  and  in  particular  L?t  (W)  —  0.  We  leave  it  as 
an  exercise  for  the  reader  to  verify  that  formula  (9.8)  applied  to  L°t{W)  —  L^~(W) 
gives  a  result  which  is  consistent  with  the  preceding  expression  for  L^(W). 

We  will  now  give  an  extension  of  Ito’s  formula  (in  the  case  where  it  is  applied 
to  a  function  of  a  single  semimartingale).  If  /  is  a  convex  function  on  R,  the  left 
derivative/^  is  a  left-continuous  monotone  nondecreasing  function,  and  there  exists 
a  unique  Radon  measure  f'(dy)  on  M+  such  that  —  f_(b )  — f_(a ),  for 

every  a  <  b.  One  can  also  interpret  f"  as  the  second  derivative  of  /  in  the  sense  of 
distributions.  Note  that  /"  (da)  —  f'(a)da  iff  is  twice  continuously  differentiable.  If 
/  is  now  a  difference  of  convex  functions,  that  is,/  =  f\  —f2  where  both/  and/2  are 
convex,  we  can  still  make  sense  of  f  f"  (dy)  cp(y)  —  f  /"  (dy)  cp(y)  —  f  ff(dy)  cp(y) 
for  any  bounded  measurable  function  (p  supported  on  a  compact  interval  of  R. 

The  next  theorem  identifies  the  increasing  process  At  that  appeared  in  Proposi¬ 
tion  9.1. 
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Theorem  9.6  (Generalized  Ito  formula)  Let  f  be  a  difference  of  convex  functions 
on  R.  Then,  for  every  t  >  0, 


fix,)  = 


f_(xs)  dXs  + 


Remark  By  Proposition  9.2  and  a  continuity  argument,  we  have 


Lat(X)  —  0  for  every  a  £ 


min  Xc,  max  Xs  , 

Lo  <s<t  0  <s<t  J 


a.s. 


and  furthermore  the  function  a  i->  Lat{X)  is  bounded.  Together  with  the  observations 
preceding  the  statement  of  the  theorem,  this  shows  that  the  integral  fR  LffX)f"(da) 
makes  sense. 

Proof  By  linearity,  it  suffices  to  treat  the  case  when  /  is  convex.  Furthermore,  by 
simple  “localization”  arguments,  we  can  assume  that/"  is  a  finite  measure  supported 
on  the  interval  [-K,  K]  for  some  K  >  0.  By  adding  an  affine  function  to/,  we  can 
also  assume  that /  =  0  on  (— oo,  —K\.  Then,  it  is  elementary  to  verify  that,  for  every 

V  G  R, 

fix)  =  f  ( x-a)+ f"  {da ), 

Jr 

and 

f'-(x)  =  I  l{a<x]f(da).  (9.13) 

Tanaka’s  formula  gives,  for  every  a  £  R, 


(X,  -  a)+  =  (Xo  -  a)+  +  Yf  +  Z“  +  -L“(X), 


,+ 


where  we  use  the  notation  of  the  proof  of  Theorem  9.4  (and  we  recall  that  (Ya,  a  e 
R)  stands  for  the  continuous  modification  obtained  in  Lemma  9.5).  We  can  integrate 
the  latter  equality  with  respect  to  the  finite  measure  f"  (da)  and  we  get 


f  Yff 


/ 


f(Xt )  =f(X o)  +  /  Ylf’(Aa)  +  /  Z^f  {da)  +  ^ 


(X)f(da). 


By  Fubini’s  theorem, 


9.2  Continuity  of  Local  Times  and  the  Generalized  ltd  Formula 
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So  the  proof  will  be  complete  if  we  can  also  verify  that 

J  Y‘‘  f" (da)  =  J  f- (V)  d Ms.  (9. 14) 

This  identity  should  be  viewed  as  a  kind  of  Fubini  theorem  involving  a  stochastic 
integral.  To  provide  a  rigorous  justification,  it  is  convenient  to  introduce  the  stopping 
times  Tn  inf{s  >  0  :  ( M,M)S  >  n },  for  every  n  >  1.  Recalling  (9.13),  we  see 
that  our  claim  (9.14)  will  follow  if  we  can  verify  that,  for  every  n  >  1,  we  have  a.s. 

f  ( J  "  l{Xs>a}dMsy"(da)  =  J  "  (  f  \{Xs>a)f'{da))dMs,  (9.15) 

where  in  the  left-hand  side  we  agree  that  we  consider  the  continuous  modification 
of  a  i->  /qAT"  l{zs>fl}dMj  provided  by  Lemma  9.5.  It  is  straightforward  to  verify  that 

the  left-hand  side  of  (9.15)  defines  a  martingale  m{  in  H2,  and  furthermore,  for  any 
other  martingale  N  in  H2, 


EW,N)  oo]  =  ElM^N^  =  E 


l,Xs>aid(M,N)sy"(da) 


l{Xs>a}f(da))d(M,N)s 


By  a  duality  argument  in  H2,  this  suffices  to  verify  that  M,  coincides  with  the 
martingale  of  H2  in  the  right-hand  side  of  (9. 15).  This  completes  the  proof.  □ 

The  following  corollary  is  even  more  important  than  the  preceding  theorem. 

Corollary  9.7  (Density  of  occupation  time  formula)  We  have  almost  surely,  for 
every  t  >  0  and  every  nonnegative  measurable  function  ip  on  R, 


[  <p(Xs)d(X,X)s=  [  (p(a)L“(X)da. 

Jo  Jr 


More  generally,  we  have  a.s.  for  any  nonnegative  measurable  function  F  on  M+  x  R, 


F(s,Xs)  d(X,X)s 


F(s,  a)  d sLas(X). 


Proof  Fix  t  >  0  and  consider  a  nonnegative  continuous  function  ip  on  R  with 
compact  support.  Let  /  be  a  twice  continuously  differentiable  function  on  R  such 
that  f"  —  ip.  Note  that /  is  convex  since  ip  >  0.  By  comparing  Ito’s  formula  applied 
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to  f(Xt)  and  the  formula  of  Theorem  9.6,  we  immediately  get  that  a.s. 

[  <p(Xs)d(X,X)s  =  [  <p(a)L?(X)da. 

Jo  Jr 

This  formula  holds  simultaneously  (outside  a  set  of  probability  zero)  for  every  t  >  0 
(by  a  continuity  argument)  and  for  every  function  <p  belonging  to  a  countable  dense 
subset  of  the  set  of  all  nonnegative  continuous  functions  on  R  with  compact  support. 
This  suffices  to  conclude  that  a.s.  for  every  t  >  0,  the  random  measure 

Ah>  f  lA(Xs)d(X,X)s 

Jo 


has  density  (L^(X))flGR  with  respect  to  Lebesgue  measure  on  R.  This  gives  the  first 
assertion  of  the  corollary.  It  follows  that  the  formula  in  the  second  assertion  holds 
when  F  is  of  the  type 


F(s,a)  =  ![„,„]  (s)  1  a  (a) 


where  0  <  u  <  v  and  A  is  a  Borel  subset  of  R.  Hence,  a.s.  the  a-finite  measures 


B  — > 


poo 

Jo 


lB(s,Xs)d(X,X): 


and 


B  — > 


p  p  oo 

da 

Jr  Jo 


lB(s,a)dsLa(X ) 


take  the  same  value  for  B  of  the  form  B  —  [u,v]  x  A,  and  this  implies  that  the  two 
measures  coincide.  □ 

If  X  =  M  +  V  is  a  continuous  semimartingale,  then  an  immediate  application  of 
the  density  of  occupation  time  formula  gives,  for  every  be  R, 


l  l{xs=b}d(M,M)s=  l  l{b}(a)L?(X)da  =  0. 
Jo  Jr 


This  property  has  already  been  derived  after  the  proof  of  Lemma  9.5.  On  the  other 
hand,  there  may  exist  values  of  b  such  that 


1  {xs=b}dVs  ^  0, 

and  these  values  of  b  correspond  to  discontinuities  of  the  local  time  with  respect  to 
the  space  variable,  as  shown  by  Theorem  9.4. 
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Corollary  9.8  IfX  is  of  the  form  Xt  —  Xq  +  Vt,  where  V  is  a  finite  variation  process , 
then  Lat{X)  —  0  for  all  a  e  R  and  t  >  0. 

Proof  From  the  density  of  occupation  time  formula  and  the  fact  that  (X,  X)  =  0, 
we  get  fR  cp(a)  Ifi(X)  da  —  0  for  any  nonnegative  measurable  function  and  the 
desired  result  follows.  □ 

Remark  We  could  have  derived  the  last  corollary  directly  from  Tanaka’s  formula. 


9.3  Approximations  of  Local  Times 

Our  first  approximation  result  is  an  easy  consequence  of  the  density  of  occupation 
time  formula. 

Proposition  9.9  Let  X  be  a  continuous  semimartingale.  Then  a.s.  for  every  a  e  R 
and  t  >  0, 


C'(X)  =  lim  -  [  LaSX,:Sa+f  ,d(X,X) 

£  Jo 


Proof  By  the  density  of  occupation  time  formula, 


1  rt  y  rci-\-s 

~  /  Va<Xs<a+e}  d{X,  X)s=~  g 
£  J 0  £  Ja 


(X)  db. 


and  the  result  follows  from  the  right-continuity  of  b  i->  Lb (X)  at  a  (Theorem  9.4). 


□ 


Remark  The  same  argument  gives 

lim  1  [  \{a-£<xs<a+e}  d(X,X)s  =  \  (Lat(X)  +  L“-(X)). 
s^O  Zs  Jo  2 

The  quantity  Ifi(X)  |  (Lat(X)  +  La~(X))  is  sometimes  called  the  symmetric 
local  time  of  the  semimartingale  X.  Note  that  the  density  of  occupation  time 
formula  remains  true  if  L“(X)  is  replaced  by  Lat{X)  (indeed,  Lat(X)  and  Lat(X) 
may  differ  in  at  most  countably  many  values  of  a).  The  generalized  Ito  formula 
(Theorem  9.6)  also  remains  true  if  Lat(X)  is  replaced  by  Ifi(X),  provided  the  left- 
derivative  f_  is  replaced  by  \if+  +  ff).  Similar  observations  apply  to  Tanaka’s 
formulas. 

As  a  consequence  of  the  preceding  proposition  and  Lemma  9.5,  we  derive  a 
useful  bound  on  moments  of  local  times. 
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Corollary  9.10  Let  p  >  1.  There  exists  a  constant  Cp  such  that,  for  any  continuous 
semimartingale  X  with  canonical  decomposition  X  —  M  +  V,  we  have  for  every 
a  e  R  and  t  >  0, 

<  cp 

Proof  This  readily  follows  from  the  bound  of  Lemma  9.5,  using  the  approximation 
of  Lf  (X)  in  Proposition  9.9  and  Fatou’s  lemma.  □ 

We  next  turn  to  the  upcrossing  approximation  of  local  time.  We  first  need  to 
introduce  some  notation.  We  let  X  be  a  continuous  semimartingale,  and  s  >  0. 
We  then  introduce  two  sequences  (pf)n>\  and  of  stopping  times,  which  are 

defined  inductively  by 

af  :=  inf {t  >  0  :  Xt  —  0}  ,  x\  inf {t  >  erf  \Xt  —  s) , 

and,  for  every  n  >  1 , 

<+i  :=  inf{>  >  r„£  :  X,  =  0} ,  r„£+1  :=  inf'r  >  aen  :  X,  =  s} . 

We  then  define  the  upcrossing  number  of  X  along  [0,  s]  before  time  t  by 

Nf(t)  —  Card {n  >  1  :  xEn  <  f}. 

This  notion  has  already  been  introduced  in  Sect.  3.3  with  a  slightly  different 
presentation. 

Proposition  9.11  We  have,  for  every  t  >  0, 

sNf(t)  — ► 

z 


(£[((M,M)f)p/2]+£[(^  |dV,|)P]) 


m  probability. 

Proof  To  simplify  notation,  we  write  L°s  instead  of  L°s  (X)  in  this  proof.  We  first  use 
Tanaka’s  formula  to  get,  for  every  n  >  1, 


(X 


TnAt 


) 


+ 


+ 


(X*'A  tV  = 


rrnAt 

JafiAt 


-{xv>o}dX^ 


TnAt 


We  sum  the  last  identity  over  all  n  >  1  to  get 


oo 


E  ((^a0+ 


(XCT|Af)+)  = 


ft  x  00 


■(0Vf,T=] 


■{Xs>0} 


j  OO 


«  =  1 


(9.16) 
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Note  that  there  are  only  finitely  many  values  of  n  such  that  r"  <  t,  and  that  the 
interversion  of  the  series  and  the  stochastic  integral  is  justified  by  approximating 
the  series  with  finite  sums  and  using  Proposition  5.8  (the  required  domination  is 
obvious  since  the  integrands  are  bounded  by  1). 

Consider  the  different  terms  in  (9.16).  Since  the  local  time  L°  does  not  increase 
on  intervals  of  the  type  [r®,  crf+1)  (nor  on  [0,  erf)),  we  have 

oo  oo 

-  <■«,)  =  2X-+1«.  -  4a.)  =  4 

n= 1  n=  1 

Then,  noting  that  (XzsAt  )+  -  (X<Ar)+  =  s  if  S  t ,  we  have 

oo 

S  ((X^aO+  -  (X*A,)+)  =  eiVfCO  +  M(e). 

«  =  1 


where  0  <  u(s)  <  e. 

From  (9.16)  and  the  last  two  displays,  the  result  of  the  proposition  will  follow  if 
we  can  verify  that 


oo 


f  (2^  1«.4] (J) ) l{xs>o}dX^  — >  0 

J°  n=  1 


in  probability.  This  is  again  a  consequence  of  Proposition  5.8,  since 


oo 


0  -  (  ^2 

n=  1 


{Xs>0}  <  l{0<Xs<e} 


and  l{o<xv<e}  — >  0  as  e  ->  0. 


□ 


9.4  The  Local  Time  of  Linear  Brownian  Motion 

Throughout  this  section,  (Bt)t> o  is  a  real  Brownian  motion  started  from  0  and  (J^r) 
is  the  (completed)  canonical  filtration  of  5. 

The  following  theorem,  which  is  known  as  Trotter’s  theorem,  is  essentially 
a  restatement  of  the  results  of  the  previous  sections  in  the  special  case  of  real 
Brownian  motion.  Still  the  importance  of  the  result  justifies  this  repetition.  We  write 
supp  (fi)  for  the  topological  support  of  a  finite  measure  /x  on  M+. 

Theorem  9.12  (Trotter)  There  exists  a  ( unique )  process  q,  whose 

sample  paths  are  continuous  functions  of  the  pair  (a,  t),  such  that,  for  every  fixed 
a  e  R,  (Lf(B))t>o  is  an  increasing  process,  and,  a.s.  for  every  t  >  0,  for  every 
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nonnegative  measurable  function  <p  on  R, 


Furthermore,  a.s.for  every  aeM, 

supp(d5Z^(Z?))  c  {s  >  0  :  Bs  —  a},  (9.17) 

and  this  inclusion  is  an  equality  with  probability  one  if  a  is  fixed. 

Proof  The  first  assertion  follows  by  applying  Theorem  9.4  and  Corollary  9.7  to 
X  —  B,  noting  that  (B,B)t  —  t.  We  have  already  seen  that  the  inclusion  (9.17) 
holds  with  probability  one  if  a  is  fixed,  hence  simultaneously  for  all  rationals,  a.s.  A 
continuity  argument  allows  us  to  get  that  (9.17)  holds  simultaneously  for  all  a  e  R 
outside  a  single  set  of  probability  zero.  Indeed,  suppose  that  for  some  a  e  R  and 
0  <  s  <  t  we  have  >  Las{B)  and  Br  a  for  every  re  [s,  t\.  Then  we  can  find 
a  rational  b  eR  sufficiently  close  to  a  such  that  the  same  properties  hold  when  a  is 
replaced  by  b ,  giving  a  contradiction. 

Finally,  let  us  verify  that  (9.17)  is  an  a.s.  equality  if  a  e  R  is  fixed.  So  let  us  fix 
a  e  R,  and  for  every  rational  q  >  0,  set 

Hq  inf {t  >  q  :  Bt  =  a). 

Our  claim  will  follow  if  we  can  verify  that  a.s.  for  every  e  >  0,  Lk+s(B)  >  LUB)- 

q  q 

Using  the  strong  Markov  property  at  time  Hq ,  it  suffices  to  prove  that,  if  B'  is  a  real 
Brownian  motion  started  from  a ,  we  have  Las(Bf)  >  0,  for  every  e  >  0,  a.s.  Clearly 
we  can  take  a  =  0.  We  then  observe  that  we  have 

L°e(B)  = 

by  an  easy  scaling  argument  (use  for  instance  the  approximations  of  the  previous 
section).  Also  P(L°{(B)  >  0)  >  0  since  E[L®(B)\  —  E[\B\ |]  by  Tanaka’s  formula.  An 
application  of  Blumenthal’s  zero-one  law  (Theorem  2.13)  to  the  event 

oo 

A  :=  y]\C-AB)  >  0}  =  lim  f  {L°2-„(B)  >  0} 

1  1  n'foo 


completes  the  proof.  □ 

Remark  Theorem  9.12  remains  true  with  a  similar  proof  for  an  arbitrary  (possibly 
random)  initial  value  Z?o- 

We  now  turn  to  distributional  properties  of  local  times  of  Brownian  motion. 


9.4  The  Local  Time  of  Linear  Brownian  Motion 
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Proposition  9.13  (i)  Let  a  e  M\{0}  and  Ta  inf{t  >  0  :  Bt  =  a).  Then  L(|  (B) 
has  an  exponential  distribution  with  mean  2\a\. 

(ii)  Let  a  >  0  and  Ua  inf{t  >  0  :  \Bt\  —  a}.  Then  L(Jj  (B)  has  an  exponential 
distribution  with  mean  a. 

Proof 

(i)  By  simple  scaling  and  symmetry  arguments,  it  is  enough  to  take  <2=1.  We  then 
observe  that  L0oo(B)  =  oo  a.s.  Indeed,  the  scaling  argument  of  the  preceding 
proof  shows  that  L(^X)(B)  has  the  same  distribution  as  A L(^Q(B),  for  any  A  >  0, 
and  we  have  also  seen  that  L^^B)  >  0  a.s.  Fix  s  >  0  and  set 

r  :=  inf{t  >  0  :  L°t{B)  >  s}, 

so  that  r  is  a  stopping  time  of  the  filtration  .  Furthermore,  Bx  —  0  by  the 
support  property  of  local  time.  By  the  strong  Markov  property, 

B  't  :=  Br+t 

is  a  Brownian  motion  started  from  0,  which  is  also  independent  of 
Proposition  9.9  gives,  for  every  t  >  0, 

L°t(B')  =  L°t+t(B )  -  i. 

On  the  event  {L^  ( B )  >  =  {r  <  T\},  we  thus  have 

4,  (B)  -  s  =  4,  — r  (B' )  -  4;  (B'), 

where  T[  :=  inf{t  >  0  :  B[  =  1}.  Since  the  event  {r  <  T\)  is  ^-measurable 
and  B'  is  independent  of  we  get  that  the  conditional  distribution  of  L®  (B)  — 
s  knowing  that  L®  ( B )  >  s  is  the  same  as  the  unconditional  distribution  of 
Lj  ( B ).  This  implies  that  the  distribution  of  L®  (B)  is  exponential. 

Finally,  Tanaka’s  formula  (9.6)  shows  that  ^E[L®aT J  =  E[(BtATl)+].  As  t  -> 
oo,  converges  to  ]  by  monotone  convergence  and  E[(Bt Ar1)+] 

converges  to  E[{BTx)+]  by  dominated  convergence,  since  0  <  (^Ari)+  <  1. 
This  shows  that  E[L®aTi\  =  2,  as  desired. 

(ii)  The  argument  is  exactly  similar.  We  now  use  Tanaka’s  formula  (9.5)  to  verify 

that  E[L°Ua  (Z?)]  —  a.  □ 

Remark  One  can  give  an  alternative  proof  of  the  proposition  using  stochastic 
calculus.  To  get  (ii),  for  instance,  use  Ito’s  formula  to  verify  that,  for  every  A  >  0, 

(1  +X\Bt\)  exp(— A 4(B)) 

is  a  continuous  local  martingale,  which  is  bounded  on  [0,  Ua\.  An  application  of 
the  optional  stopping  theorem  then  shows  that  £[exp(— AL^  (/?))]  =  (1  +  A a)~{ . 
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The  previous  proof  has  the  advantage  of  explaining  the  appearance  of  the  exponen¬ 
tial  distribution. 

For  every  t  >  0,  we  set 


St  :=  sup  Bs  ,  It 

0<s<t 


inf  Bs  . 

0<s<t 


Theorem  9.14  (Levy)  The  two  processes  (St,  St  —  Bt)t> o  and  (L®(B),  \Bt\)t>o  have 
the  same  distribution. 

Remark  By  an  obvious  symmetry  argument,  the  pair  (—It,  Bt  —  It)t> o  also  has  the 
same  distribution  as  (St,  St  —  Bt)t>  q. 

Proof  By  Tanaka’s  formula,  for  every  t  >  0, 


-h+L%B), 


(9.18) 


where 


fir  =  f  sgn(fij)  dBs. 

Jo 

Since  (ft,  /3)t  =  t,  Theorem  5.12  ensures  that  fi  is  a  real  Brownian  motion  started 
from  0.  We  then  claim  that,  for  every  t  >  0, 

L®(B)  —  swp{/3s  :  s  <  t}. 

The  fact  that  L®(B)  >  sup {/3S  :  s  <  t}  is  immediate  since  (9.18)  shows  that  L®(B)  > 
L°s  (B)  >  /3S  for  every  ^  G  [0,  t\.  To  get  the  reverse  inequality,  write  yt  for  the  last 
zero  of  B  before  time  t.  By  the  support  property  of  local  time,  L®(B)  =  Iy  (B ),  and 

using  (9.18),  Ly  (B)  =  fjyi  <  sup{^s  :  s  <  t}. 

We  have  thus  proved  a.s. 

|S/|)f>o  =  (sup {fis  :  s  <  t},  sup{fis :  s  <  t}  -  fi,),> 0, 

and  since  ( fis)s>o  and  (Bs)s> o  have  the  same  distribution,  the  pair  in  the  right-hand 
side  has  the  same  distribution  as  (St,  St  —Bt)t> q.  □ 

Theorem  9.14  has  several  interesting  consequences.  For  every  t  >  0,  St  has  the 
same  law  as  \Bt\  (Theorem  2.21),  and  thus  the  same  holds  for  L®(B).  From  the 
explicit  formula  (2.2)  for  the  density  of  (St,  Bt ),  we  also  get  the  density  of  the  pair 
(p(B),Bt). 

For  every  s  >  0,  set 


rs  inf {t  >  0  :  L°t(B)  >  51}. 
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The  process  (zs)s>o  is  called  the  inverse  local  time  (at  0)  of  the  Brownian  motion 
B.  By  construction,  (r5)5>o  has  cadlag  increasing  sample  paths.  From  Levy’s 
Theorem  9.14,  one  gets  that 


(*s)s>0  —  (Ts)s>  0, 


where,  for  every  s  >  0,  Ts  :  —  inf {t  >  0  :  Bt  >  s}  (it  is  easy  to  verify  that,  for  every 
s  >  0,  Ts  —  Ts  a.s.,  but  (Ts)s>o  has  cadlag  sample  paths,  which  is  not  the  case  for 
(Ts)s>0). 

The  same  application  of  the  strong  Markov  property  as  in  the  proof  of  Proposi¬ 
tion  9.13  shows  that  (r5)5>o  has  stationary  independent  increments  -  compare  with 
Exercise  2.26.  Furthermore,  using  the  invariance  of  Brownian  motion  under  scaling, 
we  have  for  every  A  >  0, 


( J-Xs)s>0  —  6^  Ts)s> 0* 


The  preceding  properties  can  be  summarized  by  saying  that  ( zs)s>o  is  a  stable 
subordinator  with  index  1  /2  (a  subordinator  is  a  Levy  process  with  nondecreasing 
sample  paths). 

The  interest  of  considering  the  process  (rs)s>o  comes  in  part  from  the  following 
proposition. 

Proposition  9.15  We  have  a.s. 

{t  >  0  :  Bt  —  0}  =  {zs  :  s  >  0}  U  {zs-  :  s  e  D} 

where  D  is  the  countable  set  of  jump  times  of  ( rs)s>o- 
Proof  We  know  from  (9.17)  that  a.s. 

supp(drL^(5))  c  {t  >  0  :  Bt  =  0}. 

It  follows  that  any  time  t  of  the  form  t  —  zs  or  t  —  zs-  must  belong  to  the  zero  set 
of  B.  Conversely,  recalling  that  (9.17)  is  an  a.s.  equality  for  a  —  0,  we  also  get  that, 
a.s.  for  every  t  such  that  Bt  —  0,  we  have  either  L°tJrE(B)  >  L®(B)  for  every  e  >  0, 
or,  if  t  >  o,  L°t  (B)  >  lft_s(B )  for  every  s  >  0  with  s  <  t  (or  both  simultaneously), 
which  implies  that  we  have  t  —  rLo^  or  t  —  rLo^_.  □ 

As  a  consequence  of  Proposition  9.15,  the  connected  components  of  the  comple¬ 
ment  of  the  zero  set  {t  >  0  :  Bt  —  0}  are  exactly  the  intervals  (r5_,  zs)  for  s  e  D. 
These  connected  components  are  called  the  excursion  intervals  (away  from  0).  For 
every  s  e  D,  the  associated  excursion  is  defined  by 

esif)  •—  B(rs—-\-t)/\T5  > 


t  >  0. 
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The  goal  of  excursion  theory  is  to  describe  the  distribution  of  the  excursion  process, 
that  is,  of  the  collection  ( es)seD .  This  study,  however,  goes  beyond  the  scope  of  the 
present  book,  see  in  particular  [4,  70,  72]. 


9.5  The  Kallianpur-Robbins  Law 


In  this  section,  we  use  local  times  to  give  a  short  proof  of  the  Kallianpur-Robbins 
law  for  planar  Brownian  motion,  which  was  stated  at  the  end  of  Chap.  7  as 
Theorem  7.23.  Let  us  recall  the  notation  we  need.  We  let  B  stand  for  a  complex 
Brownian  motion,  and  for  simplicity  we  assume  that  B$  —  1  (the  general  case  will 
then  follow,  for  instance  by  applying  the  strong  Markov  property  at  the  first  hitting 
time  of  the  unit  circle).  According  to  Theorem  7.19,  we  can  write  \Bt\  —  exp(/3#r), 
where  P  is  a  real  Brownian  motion  started  from  0,  and 


=  inf{s  >  0  :  f  exp(2/3M)  d u  >  t}. 

Jo 


For  every  A  >  0,  we  also  consider  the  scaled  Brownian  motion  P^tX)  —  Jj-  fix2 ?,  and 
for  t  >  1  we  use  the  notation  Xt  —  (log  t)/  2. 

We  aim  at  proving  that,  for  every  R  >  0, 


2  C 

: -  /  l{l#vl<^} 

log*  Jo 

converges  in  distribution  as  t  ->  oo  to  an  exponential  distribution  with  mean  R2.  To 
this  end,  we  write,  for  every  fixed  t  >  1 , 


log 


-r 

t  Jo 


{\Bs\<R} 


ds  = 


1  r 

J  1  !/>/;,  -  log  ;  d'"1' 


-u 


Ht 


=  t  /  l{/3„<iog«}  exp(2^„)dM 


=  A,  f 

Jo 


(a  ,r2H, 


^{^,)<(A,)-1log«} 


exp(2Af  PlJ'O  d  u 


/.(A,)-1  log  R 

—  A,  / 

7  —  00 


exp(2Ara)  Laa  r2Hi(P(X,))  da 
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In  the  second  last  equality,  we  applied  the  density  of  occupation  time  formula 
(Corollary  9.7)  to  the  Brownian  motion  and  in  the  last  one  we  used  the  change 
of  variables  r  —  eXtCl.  As  t  ->  oo,  (Ar)-1logr  — >  0,  for  every  r  >  0,  and 
Lemma  7.21  also  tells  us  that  (A t)~2Ht  —  T[Xt)  converges  in  probability  to  0,  with 

the  notation  =  infj.v  >  0  :  fi)  =1}-  From  the  joint  continuity  of  Brownian 
local  times  (Theorem  9.12),  we  then  get  that,  for  every  s  e  (0 ,R), 


sup 

s<r<R 


oo 


in  probability.  By  combining  this  with  the  previous  display,  we  obtain  that 


2 

log  t 


l{|Bs|<ff}  ds- 


in  probability.  To  complete  the  proof,  we  just  note  that  the  law  of  does 


not  depend  on  A  >  0,  and  is  exponential  with  mean  2,  by  Proposition  9.13. 


Remark  The  preceding  proof  shows  that  the  limiting  exponential  variable  in 
Theorem  7.23  does  not  depend  on  the  choice  of  R ,  in  the  sense  that  we  can  obtain  a 
joint  convergence  by  taking  several  values  of  R ,  with  the  same  exponential  variable 
in  the  limit,  up  to  multiplicative  constants.  This  can  also  be  deduced  from  the 
Chacon-Ornstein  ergodic  theorem,  which  implies  that  the  same  limit  in  distribution 
holds  more  generally  for  the  occupation  time  of  an  arbitrary  compact  subset  K  of 
C,  the  constant  R2  then  being  replaced  by  n~l  times  the  Lebesgue  measure  of  K. 
Our  method  of  proof  also  shows  that  the  convergence  in  the  Kallianpur-Robbins 
theorem  holds  jointly  with  that  of  windings  in  Spitzer’s  theorem  (Theorem  7.20) 
and  the  limiting  joint  distribution  is  the  law  of  (/3),  y^,),  where  j3  and  y  are 

independent  real  Brownian  motions  started  from  0,  and  T\  —  inf{7  >0:^  =  1}. 


Exercises 

Exercise  9.16  Let /  :  R  — >  R  be  a  monotone  increasing  function,  and  assume  that 
/  is  a  difference  of  convex  functions.  Let  X  be  a  semimartingale  and  consider  the 
semimartingale  Yt  —  f(Xt).  Prove  that,  for  every  a  e  R, 

Lat(Y)  =4(fl)L?(X),  La~(Y)  —  f!_  (a)  La~  (X) . 

In  particular,  if  X  is  a  Brownian  motion,  the  local  times  of  f(X )  are  continuous  in 
the  space  variable  if  and  only  if  /  is  continuously  differentiable. 
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Exercise  9.17  Let  M  be  a  continuous  local  martingale  such  that  (M,  M)^  —  oo 
a.s.,  and  let  B  be  the  Brownian  motion  associated  with  M  via  the  Dambis-Dubins- 
Schwarz  theorem  (Theorem  5.13).  Prove  that,  a.s.  for  every  a  >  0  and  t  >  0, 

Exercise  9.18  Let  X  be  a  continuous  semimartingale,  and  assume  that  X  can  be 
written  in  the  form 


where  B  is  a  Brownian  motion  and  a  and  Z?  are  progressive  and  locally  bounded. 
Assume  that  a(co,s)  ^  0  for  Lebesgue  a.e.  s  >  0,  a.s.  Show  that  the  local  times 
L“(X)  are  jointly  continuous  in  the  pair  (a,t). 

Exercise  9.19  Let  X  be  a  continuous  semimartingale.  Show  that  the  property 

supp(d5Z^(X))  c  {s  >  0  :  Xs  =  a) 

holds  simultaneously  for  all  a  eR,  outside  a  single  set  of  probability  zero. 

Exercise  9.20  Let  B  be  a  Brownian  motion  started  from  0.  Show  that  a.s.  there 
exists  an  a  e  R  such  that  the  inclusion  supp(d SL“(B))  C  {s  >  0  :  Bs  —  a)  is  not  an 
equality.  (Hint:  Consider  the  maximal  value  of  B  over  [0, 1].) 

Exercise  9.21  Let  B  be  a  Brownian  motion  started  from  0.  Note  that 

oo 

l{Bt>o}ds  =  oo 


a.s.  and  set,  for  every  t  >  0, 

At  =  f  l{B,>o}d^  ,  a,  =  inf f.y  >  0  :  As  >  t}. 

Jo 


1.  Verify  that  the  process 


not 

Yt=  l{Bs>o}dS.s 

Jo 

is  a  Brownian  motion  in  an  appropriate  filtration. 

2.  Show  that  the  process  At  —  L®  (B)  has  nondecreasing  and  continuous  sample 
paths,  and  that  the  support  of  the  measure  d^  As  is  contained  in  {s  :  B0s  —  0}. 

3.  Show  that  the  process  (B0t)t> o  has  the  same  distribution  as  (I^IX>o. 
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Exercise  9.22  (Skew  Brownian  motion )  Let  a,  f)  >0  and  consider  the  function 
g(v)  =  ot l{x>o}  —  /31{x<o}.  Let  X  be  a  continuous  semimartingale  such  that 

f  g(Xs)dBs ,  (9.19) 

Jo 


where  5  is  a  Brownian  motion. 

1.  Set  <p(x)  =  \  *  l{x>o}  -  jfX  1{^<o},  and  Y,  =  <p(Xt).  Prove  that  Y,  =  <p(Xt)  solves 
the  equation 


Y,  =  Bt+X-{\-°t)L%Y) 

(use  the  result  of  Exercise  9.16). 

2.  Compute  Z^(L)  —  L®~(Y)  in  terms  of  Z^(y),  in  two  different  ways. 

3.  Starting  from  a  Brownian  motion  /3  with  /3q  =  0,  set 


07  =  inf {s  >  0  :  As  >  t). 


Verify  that  the  process  Xt  —  satisfies  the  equation  (9.19)  in  an  appropriate 
filtration  and  with  an  appropriate  Brownian  motion  B. 

Exercise  9.23  Let  g  :  R  — >  R  be  a  real  integrable  function  (fR  |g(x)|dx  <  oo). 
Let  B  be  a  Brownian  motion  started  from  0,  and  set 


At  =  f  g(Bs)ds. 

Jo 

1.  Justify  the  fact  that  the  integral  defining  At  makes  sense,  and  verify  that,  for  every 
c  >  0  and  every  u  >  0,  Aciu  has  the  same  distribution  as 


2.  Prove  that 


where  N  is  yK(0, 1). 

Exercise  9.24  Let  a  and  b  be  two  locally  bounded  measurable  functions  on  M+  xR, 
and  consider  the  stochastic  differential  equation 


E(<t,  b ) 


dXt  =  cr(t,  Xt)  d Bt  +  b(t,  Xt)  d t. 
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Let  X  and  X '  be  two  solutions  of  E(o,  b ),  on  the  same  filtered  probability  space  and 


with  the  same  Brownian  motion  B. 

1.  Suppose  that  L°t(X  —  Xr)  —  0  for  every  t  >  0.  Show  that  both  Xt  v  X[  and  Xt  A  X[ 
are  also  solutions  of  E(cr,  b).  (Hint:  Write  Xt  v  X't  —  Xt  +  (X[  —  X?)+,  and  use 
Tanaka’s  formula.) 

2.  Suppose  that  <j(t,x )  =  1  for  every  t,x.  Show  that  the  assumption  in  question  1. 
holds  automatically.  Suppose  in  addition  that  weak  uniqueness  holds  for  E(o,  b). 
Show  that,  if  Xo  =  X^  —  x  e  R,  the  two  processes  X  and  X ’  are  indistinguishable. 

Exercise  9.25  (Another  look  at  the  Yamada-Watanabe  criterion )  Let  p  be  a  nonde¬ 
creasing  function  from  [0,  oo)  into  [0,  oo)  such  that,  for  every  s  >  0, 


Consider  then  the  one-dimensional  stochastic  differential  equation 


E(o,  b) 


dXt  =  a(Xt)dBt  +  b(Xt)dt 


where  one  assumes  that  the  functions  o  and  b  satisfy  the  conditions 


(a(x)  -cr{y))2  <  p(|x-y|)  ,  \b{x)  -b(y)\  <K\x-y\  , 


for  every  x,y  e  R,  with  a  constant  K  <  oo.  Our  goal  is  use  local  times  to  give 
a  short  proof  of  path  wise  uniqueness  for  E(o,  b)  (this  is  slightly  stronger  than  the 
result  of  Exercise  8.14). 

1.  Let  Y  be  a  continuous  semimartingale  such  that,  for  every  t  >  0, 


a.s. 


Prove  that  L®(Y)  —  0  for  every  t  >  0,  a.s. 


2.  Let  X  and  X ’  be  two  solutions  of  E(o,  b)  on  the  same  filtered  probability  space 
and  with  the  same  Brownian  motion  B.  By  applying  question  1.  to  Y  —  X  —  X', 
prove  that  L®(X  —  X')  —  0  for  every  t  >  0,  a.s.,  and  therefore, 


3.  Using  Gromwall’s  lemma,  prove  that  if  Xq  =  X'0  then  Xt  —  X't  for  every  t  >  0, 


a.s. 
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Notes  and  Comments 

The  local  time  of  Brownian  motion  was  first  discussed  by  Levy  [54]  under  the 
name  “mesure  du  voisinage”.  In  1958,  Trotter  [80]  established  the  joint  continuity 
of  Brownian  local  times  viewed  as  densities  of  the  occupation  measure.  Tanaka  [79] 
obtained  the  formulas  of  Proposition  9.2  in  the  Brownian  case.  The  local  time  of 
semimartingales  was  discussed  by  Meyer  [61],  who  derived  Theorem  9.6  in  this 
general  setting  (after  the  earlier  work  of  Tanaka  [79]  in  the  Brownian  setting). 
Yor  [84]  then  developed  the  powerful  approach  that  leads  to  Theorem  9.4.  The 
upcrossing  approximation  of  local  time  (Proposition  9. 1 1)  is  due  to  Ito  and  McKean 
[42,  Chapter  2]  for  Brownian  motion,  and  was  extended  to  semimartingales  by  El 
Karoui  [22] .  Other  approximation  results  for  the  Brownian  local  time  were  obtained 
by  Levy  [54]  (see  [42]  and  [70]).  Theorem  9.14  is  essentially  due  to  Levy  [54],  but 
our  proof  is  based  on  an  argument  from  Skorokhod  [74]  (see  [23]  for  a  related  study 
of  the  so-called  “reflection  problem”  in  the  semimartingale  setting).  Lor  further 
properties  of  local  times,  the  reader  may  consult  Chapter  V  of  [70]  or  Chapter  6 
of  [49],  as  well  as  the  classical  book  [42]  of  Ito  and  McKean.  Local  times  are 
also  a  key  ingredient  of  excursion  theory,  which  is  treated  in  the  general  setting  of 
Markov  processes  in  Blumenthal’s  book  [4].  Exercise  9.22  deals  with  the  singular 
stochastic  equation  first  studied  by  Harrison  and  Shepp  [31],  whose  solution  is  the 
so-called  skew  Brownian  motion.  Exercise  9.23  gives  the  one-dimensional  version 
of  the  Kallianpur-Robbins  law,  which  can  also  be  found  in  [48].  Exercise  9.25  is 
from  [51]. 
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Appendix  A1 

The  Monotone  Class  Lemma 


The  monotone  class  lemma  is  a  tool  of  measure  theory  which  is  very  useful  in 
several  arguments  of  probability  theory.  We  give  here  the  version  of  this  lemma  that 
is  used  in  several  places  in  this  volume. 

Let  E  be  an  arbitrary  set,  and  let  &(E)  denote  the  set  of  all  subsets  of  E.  If 
^  C  &(E),  o(fo)  stands  for  the  smallest  a -field  on  E  containing  ^  (it  is  also  the 
intersection  of  all  a -fields  containing  c€). 

Definition  A  subset  of  &(E)  is  called  a  monotone  class  if  the  following 
properties  hold: 

(i)  E  £  . 

(ii)  If  A,B  £  M  and  A  C  B,  then  B\A  £  M  . 

(iii)  If  (An)n> o  is  an  increasing  sequence  of  subsets  of  E  such  that  An  £  for 

every  n  >  0,  then  |^J  An  £  . 

n>  0 

A  a-field  is  a  monotone  class.  As  in  the  case  of  a-fields,  one  immediately 
checks  that  the  intersection  of  an  arbitrary  collection  of  monotone  classes  is  again  a 
monotone  class.  If  ^  is  an  arbitrary  subset  of  &(E),  the  monotone  class  generated 
by  ^ ,  which  is  denoted  by  is  defined  by  setting 

jh&)  =  Pi  .//. 

monotone  class, 


Monotone  class  lemma  If  lo  C  2?(E)  is  stable  under  finite  intersections,  then 

Proof  Since  any  a-field  is  a  monotone  class,  it  is  clear  that  c  a(^).  To 

prove  the  reverse  inclusion,  it  is  enough  to  verify  that  is  a  a-field.  However, 

a  monotone  class  is  a  a-field  if  and  only  if  it  is  stable  under  finite  intersections 
(indeed,  considering  the  complementary  sets  shows  that  it  is  then  stable  under  finite 
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unions,  and  via  an  increasing  passage  to  the  limit  one  gets  that  it  is  also  stable  under 
countable  unions).  Let  us  show  that  ^ZZiftf)  is  stable  under  finite  intersections. 

For  every  A  G  &(E),  set 

^Ha  —  {B  £  Jt&)  :AHB  e 

Fix  A  G  Z?.  Since  Z?  is  stable  under  finite  intersections,  we  have  Z?  C  jZZa-  Let  us 
verify  that  is  a  monotone  class: 

•  E  e  is  trivial. 

•  If  B ,  B'  g  jiZA  and  B  C  B' ,  we  have  A  D  (B'\B)  =  (A  D  B/)\(A  D  B)  G  ^ZZifio) 
and  thus  B'\B  g  ^ZZa- 

•  If  Bn  g  jZZa  for  every  n  >  0  and  the  sequence  {Bn)n> o  is  increasing,  we  have 
A  fl  (U Bn)  =  U(A  D  Bn )  G  jii{f€)  and  therefore  U Bn  G  jZZa- 

Since  Ma  is  a  monotone  class  that  contains  <J£a  also  contains  ^ZZifio).  We  have 
thus  obtained  that 


VA  g^,  V5g  Jtifg),  a  n  B  g  Jt(fg). 

This  is  not  yet  the  desired  result,  but  we  can  use  the  same  idea  another  time. 
Precisely,  we  now  fix  A  G  ^ZZifio).  According  to  the  first  part  of  the  proof,  Z?  C  jZZa- 
By  exactly  the  same  arguments  as  in  the  first  part  of  the  proof,  we  get  that  <Ma  is 
a  monotone  class.  It  follows  that  ^ZZifio)  c  sHa,  which  shows  that  .JZifio)  is  stable 
under  finite  intersections,  and  completes  the  proof.  □ 

Here  are  a  few  consequences  of  the  monotone  class  lemma  that  are  used  above. 

1.  Let  si  be  a  a-field  on  E ,  and  let  /x  and  v  be  two  probability  measures  on 

( E ,  si).  Assume  that  there  exists  a  class  Z?  c  si ,  which  is  stable  under  finite 
intersections,  such  that  oif€)  =  si  and  /x(A)  =  v(A)  for  every  A  G  Z".  Then 
fi  —  v.  ( Use  the  fact  that  Z?  {A  G  si  :  /x(A)  =  v(A)}  is  a  monotone  class.) 

2.  Let  (Xi)isi  be  an  arbitrary  collection  of  random  variables,  and  let  be  a  a-field 
on  the  same  probability  space.  In  order  to  show  that  the  a-fields  a(W,  i  g  I)  and 

are  independent,  it  is  enough  to  verify  that  (Xi{ , ,  X;  )  is  independent  of  , 
for  any  choice  of  the  finite  set  {q, . . . ,  ip}  C  I.  ( Observe  that  the  class  of  all 
events  that  depend  on  a  finite  number  of  the  variables  W,  i  G  I,  is  stable  under 
finite  intersections  and  generates  a (X/,  i  G  /).) 

3.  Let  ( Xi)isi  be  an  arbitrary  collection  of  random  variables,  and  let  Z  be  a  bounded 
real  variable.  Let  z’o  £  E  In  order  to  verify  that  E[Z\Xt,i  G  I]  =  E[Z\  X/0],  it 
is  enough  to  show  that  E[Z  \  Xi(} ,  X^ , ... ,  X;  ]  =  E[Z  \  X[0]  for  any  choice  of  the 
finite  collection  {q , . . . ,  ip}  c  /.  ( Observe  that  the  class  of  all  events  A  such  that 
E[1aZ ]  =  E[1a  E[Z  |  Yio]]  is  a  monotone  class.) 

This  last  consequence  of  the  monotone  class  lemma  is  useful  in  the  theory  of 
Markov  processes. 


Appendix  A2 

Discrete  Martingales 


In  this  appendix,  we  recall  without  proof  the  results  about  discrete  time  martingales 
and  supermartingales  that  are  used  in  Chap.  3.  The  proof  of  the  subsequent 
statements  can  be  found  in  Neveu’s  book  [65],  and  in  many  other  books  dealing 
with  discrete  time  martingales  (see  in  particular  Williams  [82]  and  Chapter  XII  in 
Grimmett  and  Stirzaker  [30]). 

We  use  the  notation  N  =  {0, 1, 2, . . .}.  Let  us  start  by  recalling  the  basing 
definitions.  We  consider  a  probability  space  (I 2,^,P ),  and  we  fix  a  discrete 
filtration,  that  is,  an  increasing  sequence  (&n)ne n  of  sub-a-fields  of  & .  We  also  let 

oo 

&oo  =  \J% 

n= 0 


be  the  smallest  a-field  that  contains  all  the  a-fields  &n. 

Definition  A  sequence  (Tw)„e^  of  integrable  random  variables,  such  that  Yn  is 
^-measurable  for  every  n  e  N,  is  called 

•  a  martingale  if,  whenever  0  <  m  <  n,  E[Yn  |  £fm]  =  Ym\ 

•  a  supermartingale  if,  whenever  0  <  m  <  n,  E[Yn  |  £fm]  <  Ym\ 

•  a  submartingale  if,  whenever  0  <  m  <  n,  E[Yn  |  cgm]  > 

Y m  • 

All  these  notions  obviously  depend  on  the  choice  of  the  filtration  (%)neN,  which 
is  fixed  in  what  follows. 

Maximal  inequality  If  (Yn)ne ^  is  a  supermartingale,  then,  for  every  A  >  0  and 
every  k  G  N, 


<  E[\Y0\]  +  2E[\Yk\]. 
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Doob’s  inequality  in  LP  If(Yn)ne n  is  a  martingale ,  then,  for  every  k  G  N  and  every 
p  >  1, 


E 


sup 


L  0 <n<k 


Remark  This  inequality  is  interesting  only  if  £[|l^|p]  <  oo,  since  otherwise  both 
sides  are  infinite. 

If  y  =  (yn)neN  is  a  sequence  of  real  numbers,  and  a  <  h ,  the  upcrossing  number 
of  this  sequence  along  [a,  b]  before  time  n,  denoted  by  Myab(n ),  is  the  largest  integer 
k  such  that  there  exists  a  strictly  increasing  finite  sequence 


m\  <n\  <  m2  <  n2  <  -  •  -  <  nik  <  nk 


of  nonnegative  integers  smaller  than  or  equal  to  n  with  the  properties  ym  <  a  and 
ym  >  b ,  for  every  i  e  {1, . . .  ,k}.  In  what  follows  we  consider  a  sequence  Y  = 
(Yn)ne n  of  real  random  variables,  and  the  associated  upcrossing  number  MYab(n )  is 
then  an  integer- valued  random  variable. 

Doob’s  upcrossing  inequality  If  (Yn)ne ^  is  a  supermartingale,  then,  for  every  n  e 
N  and  every  a  <  b, 


E[MYab(n)}  <  f—E[(Yn  -  a)-\. 


This  inequality  is  a  crucial  tool  for  proving  the  convergence  theorems  for 
discrete-time  martingales  and  supermartingales.  Let  us  recall  two  important 
instances  of  these  theorems. 

Convergence  theorem  for  discrete-time  supermartingales  If  (Yn)ne n  is  a  super¬ 
martingale,  and  if  the  sequence  ( Yn)ne ^  is  bounded  in  Ll,  then  there  exists  a  random 
variable  Y ^  G  L1  such  that 


Convergence  theorem  for  uniformly  integrable  discrete-time  martingales  Let 

(Yn)ne n  be  a  martingale.  The  following  are  equivalent: 

(i)  The  martingale  ( Yn)ne ^  is  closed,  in  the  sense  that  there  exists  a  random 
variable  Z  e  L[  (£2,  ZZ ,  P )  such  that  Yn  —  E[Z  \  9?n]  for  every  n  e  N. 

(ii)  The  sequence  (Yn)ne^  converges  a.s.  and  in  Ll. 

(iii)  The  sequence  (Yn)ne^  is  uniformly  integrable. 

If  these  properties  hold,  the  a.s.  limit  of  the  sequence  (Yn)ne?q  is  =  E[Z  \  %o]- 
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We  now  recall  two  versions  of  the  optional  stopping  theorem  in  discrete  time. 
A  (discrete)  stopping  time  is  a  random  variable  T  with  values  in  N  U  {oo},  such 
that  {T  —  n}  G  for  every  n  G  N.  The  a-field  of  the  past  before  T  is  then 
—  {A  g  :  A  fl  {T  —  n)  G  &n,  for  every  n  g  N}. 

Optional  stopping  theorem  for  uniformly  integrable  discrete-time  martingales 

Let  (Yn)ne n  be  a  uniformly  integrable  martingale,  and  let  Too  be  the  a.s.  limit  ofYn 
when  n  —>  oo.  Then,  for  every  choice  of  the  stopping  times  S  and  T  such  that  S  <  T, 
we  have  Yt  G  L1  and 


Ys  =  E[Yt  |  &s\ 


with  the  convention  that  Yj  —  Y ^  on  the  event  {T  =  oo},  and  similarly  for  Ys. 

Optional  stopping  theorem  for  discrete-time  supermartingales  (bounded  case) 

If(Yn)ne  n  A  a  supermartingale,  then,  for  every  choice  of  the  bounded  stopping  times 
S  and  T  such  that  S  <  T,  we  have 


Ys  >  E[Yt  |  %]. 


We  conclude  with  a  variant  of  the  convergence  theorem  for  supermartingales  in 
the  backward  case.  We  consider  a  backward  filtration,  that  is,  an  increasing  sequence 
of  filtrations  indexed  by  negative  integers  (in  such  a  way  that  the  a-field 

dffn  is  “smaller  and  smaller”  when  n  ->  — oo).  A  sequence  (T„)„G_^  of  integrable 
random  variables  indexed  by  negative  integers  is  called  a  backward  supermartingale 
if,  for  every  n  G  — N,  Yn  is  ^-measurable  and,  for  every  m  <  n  <  0, 
E[Yn  I  ^rm]  <  Ym. 


Convergence  theorem  for  backward  discrete- time  supermartingales  If  (Yn)ne-^ 
is  a  backward  supermartingale,  and  if  the  sequence  (Tn)«€-N  A  bounded  in  Ll,  then 
the  sequence  (Yn)n€-^  converges  a.s.  and  in  Lx  when  n  ->  —  oo. 


It  is  crucial  for  the  applications  developed  in  Chap.  3  that  the  convergence  also 
holds  in  L1  in  the  backward  case  (compare  with  the  analogous  result  in  the  “forward” 
case). 
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